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EXPRESSMWLNO. 

KINASES AND PHOSFHi^TASES 



TECHNICAL FIELD 

The invention relates to novel nuddc acids, kinases and phosphatases mcoded by Hiese 
5 * nucldc acids, and to llieiise of these nncldc acids and pnite^ 

prev^on of cafdio vascular diseases, immune systmidisoid^, neoiological disorders, disorders 
affecting growth and developmeot, lipid disorders, cell prolifiarative disorders, and cancos. Hie 
invention also rdates to flie assessm^ of Oie Elects of exogenous conqMiands on ttie expression of 
nucleic acids and Idnases and phosphatases. 

10 

BACKGROUND OF THE INVENTION 
Reversible protdn phosphorylation is Oe ubiquitous strategy used to control many of the 
intracdhdar events in eukaryotic cells. It is estimated that more than ten p^x:eiDt of proldns active in 
a typical noanumlian cell are phosphorylated. Kinases catalyze the transfix of higti-eoe^phosidm^ 

15 groups from adenosine triphosphate (ATP) to target protdns on the hydroxy amino acid residues 
serine, threonine, or tyrosine. Phosphatases, in contrast, remove these phosphate groups. 
BsctraceUolar signals induding hormones, neurotransmitters, and growth and differentiation factors 
can activate Idnases, which can occur as cell surface receptors or as the activator of the final effector 
protein, as well as other locations, along die signal transduction pathway. Cascades of Idnases occur, 

20 as well as Idnases sensitive to second messenger nx)leciiles. This syst^n allows for the aixyplification 
of weak signals (low abundance growth factor ooolecules, for example), as well as the synthesis of 
many weak signals into an all-or-notiiing response. Phosphatases, then, are essential in detemodning 
the extent of phosphorylation in the cell and, together with kinases, regulate key cellular processes 
such as metabolic enzyme activity, proliferation, cell growth and dififerentiation, cell adhesion, and 

25 cell cycle progression. 
KINASES 

Kinases conQ>rise the largest known enzyme superfanuly and vary widely in their target 
molecules. Kinases catalyze the transfer of high mergy phosphate groups from a phosphate donor to 
a phosphate accepton^ Nuclratides.usuaUy sf^e as-the phos wbix most 

30 kinases utiliadng adraiosine triphosphate (ATP). The phosphate acceptor can he any of a variety of 
molecules, including nucleosides, nucleotides, lipids, carbohydrates, and protdns. Proteins are 
phosphoiylated on hydroxyamino adds. Addition of a phosphate group eiUm the local diarge on tihe 
acceptor molecule, causing internal confonmtional dianges and potentially iniluendng 
intmnolecular contacts. Reversible protein phosphorylation is the prirnarymelhod for regqlati^ 

35 protdnactivi^ in eukaryotic cdDs. In general, proteins are activated by phosphorylation in response 
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to CTtraceDidar signals such as honncmes, neunitraiisaDtters, and growth and differentiation factors. 
Tlie activated protdns initiate tiie cdl's intracdMar response by way of intracellular signaling 
patiiways and second messei^eff molecules such as cyclic nucleotides, calchun-cahnodolin, inositol, 
and various nmtogans, tiiat r^;ulate protein phospihorj^tion. 

S Kinases are involved in aU aspects of a cell's fiinction, frombasic naetabolic processes, such 

as glycolysis, to cell-cycle regplation, diffisr^idation, and comniunication vnOi the extracellular 
^viionsent through signal transduction cascades. Inappropriate phosphor^tion of protdns in ceDs 
has been linked to dianges in ce9 cycle progression and odldiffer^^ Qiapges in die cdl cycle 
have bem linked to induction of apoptDsis or canctt". CSianges in ceffidiffi»raitiation have been linloBd 

10 to diseases and disorders of the rqmductive systar^ immuTie system, and skeletal muscle. 

Thm are two classes of protedn Idnases. One class, protein tyrosine kinases (PTKs), 
phosphorylates ^rosine residues, and the other dass, protein serine/threonine Idnases (STKsX 
phosphorylates serine and threonine residues. Some PTKs and STKs possess structural 
characteristics of both families and have dual specificity for both tyrosine and serine/tiireonine 

IS residues. Almost aU Idnases contain a conserved 250-300 amino acid catalytic domain contain^ 
specific residues and sequence ixx>ti& characteristic of die kinase fannly. The protein kinase catalytic 
domain can be forther divided into 1 1 subdomains. N-t^ndnal subdomains I-IV fold into a two«lobed 
structure whidi binds and orients the ATP donor molecule, and subdomain V spans die two lobes. C- 
tenrinal subdomains VI-XI bind the protein substrate and transfer the gamma phosphate from ATP to 

20 the hydroxyl group of a tyrosine, serine, or threonine residue. Each of the 1 1 subdomains contains 
specific catalytic residues or amino acid motifs characteristic of that subdomain. For exaiiple, 
subdonrain I contains an 8-andno acid glycine-rich ATP binding consensus motif, subdomain 11 
contains a critical lysixie residue required for nnaximal catalytic activity, and subdomains VI through 
DC con^iise the highly conserved catalytic core. PTKs and STKs also contain distinct sequence 

25 motifs in subdomains VI and VIII which may confer hydroxyamino add specificity. 

In addition, Idnases may also be classified by additional annuo add sequences, generally 
b^em Sand 100 residues, \vhicheilliiOT flank or occur ^1h^ These additional 

amino acid sequences regulate kinase activity and detemmne substrate spedfidty. Qleviewed in 
Hardie, G. and S. Ifariks (1995^ The Protein Kma5ge PactSrBoofc rVohL pp: 17-20 Aciul^c Vie^, ' 

30 San Diego CA.). In particular, two protdn kinase signature sequraces have been identified in the 
kinase domain, die first containing an active site lysine reddue involved in ATP binding, and the 
second containing an aspartate residue inoportant for catalytic activi^. If a protdnanatyzed includes 
the two protdn kinase signatures, the probabili^ of that protein bdng a protdn kinase is close to 
100% (PROSrrB: PDOCOOlOO, NovCTob^ 199S). 
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Protein Tyrosine Kinases 

Protean ^losiiiBldiiases CPTKs) may be classified as mtheEr transmranbrane, xecqitDr PTKs or 
noiilnmsmembraiie, ]uxQrBcq>to^ Transnraibraiietyrosiiieldnases Au^ 

lecqytois fin: ooost growCh factors. Gio^vth factors bind to Aeiecqytor tyrosine 

S causes die receptor to phospboiylato itself (autopbosphoiylation) and spedfic intraceaMar second 
n£S5CTger proteuis. Growth factors (GF) diat assodato widi recqptor PTKs include epidermal GF, 
platelet-derived GF, fibroblast GF» bepatocyte GF, insulin and insolinrlike GFs, nerve GF, vascular 
endoflidial GF, and maccpifliage coloi^ stimulating factor. 

Nontransmembrane, nonreceptor PTKs lack transniCTibranB re^ons and, instead, form 

10 signaling conq>lexes widi the cytosoUc domains of plasma menibran Receptors diat 

function through non-receptor PTKs include those for cytokines and hormones (growth hormone and 
prolactin), and antigen-specific receptors on T and B lyn^hocytes. 

Many PTKs were first identified as oncogene products in cancer cells in which PTK 
activation was no longer subject to normal cellular controls. In fact, about one third of the known 

15 oncog^aes encode PTKs. Furdiermore, ceUular transformation (oncogmesis) is oftesn acconqpanied 
by increased tyrosine phosphorylation activity (Charbonneau, H. and N.K. Tonks (1992) Annu. Rev. 
Cell Biol. 8:463-493). Regulation of PTK activity imy therefore be an important strategy in 
controlling some types of cancer. 
Protein Serine/Threonine Kinases 

20 Protein serine/threonine kinases (STKs) are nontransmenobrane proteins. A subclass of STKs 

are known as ERKs (extracellular signal regulated kinases) or MAPs (nitogen-activated protran 
kinases) and are activated after ceQ stimulation by a variety of hormones and growth fitctors. Cdl 
stimulation induces a signaling cascade leading to phosphorylation of MEK (MAP/ERK Idnase) 
which, in turn, activates ERK via serine and tiireonine phosphorylation. A varied nninber of proteins 

25 x&presessX the downstream ^Sectors for the active ERK and implicate it in tiie control of ceD 
proliferation and difi^CTtiation, as weU as regidation of the cytoskeleton. Activaticmof ERKis 
nonnally transient, and cells possess dual specificity phosphatases that are responsible for its down- 
relation. Also, numinous stodies have shown that elevated ERK activity is associated witii some 
cancers. Otiier STKs include die jecond nsBSsengec-dependent protdua ldnases'SUch as the 

30 cycUc-AMP depmdent protdn kinases (PKA), caldun^calmodolin (CaM) dependent protein kinases, 
and the nutog^-activated protein kinases (MAP); the cyclinrdqpendmt protein kinases; checl^int 
and cdl cyde kinases; Nunob-associated kinase ^ak); human F^ised (hPuy, proliferation-related ' 
kinases; S'*-AMP-activated protdn kinases; and Idnases involved in apoptosis. 

One meniber of die ERK fiuxnly of MAP Idnases, ERK 7, is a novel 61-kDa protein that has 

35 motif similarities to ERKl and ERK2, but is not activated by extracellular stimiili as are ERKl and 
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ERK2 nor by tbeconmiDD activators, o-JunN^ ERK7 regulates 

its nuclear locafizaticm and inhibitioni of growth fbroagh its C-tenxonal tail, not diron]^ flie kinase 
domain as is Qnpical wifli odier MAP Idnases (Abe, M.K. (1999) Mot CfSL BioL 19:1301-1312). 
The second messei^er depends prot^ kinases primarily mediate flie efSects of second 

S messengos such as cyclic AMP (cAMP), cyclic GMP, inositol tripbosphate, pihospbatidylinosilol, 
^ 3»4,S-triplLospliate, cyclic ADP ribose, arachidonic acid, diacylgiycmd and caldun>ca]mDdi]lin. Hie 
PKAs are involved in mediatsiig hormone-induced ceUolar responses and are activatBd by c AMP 
produced within the ceU in response to bonixme stimulation. cAMP is an intracellular mediator of 
hormone action in all animal ciells tbat have been studied. Hormone-induced cellular responses 

10 include thyroid hormone secretion, coitisol secretion, progesterone secretion, glycogen breakdown, 
bone resorption, and regulation of heait rate and force of heart muscle contraction. PKA is found in 
all animal cells and is tiiought to account for the effects of c AMP in most of these cells. Altered PKA 
expression is inq>licatBd in a variety of disorders and diseases including cancer, tiiyroid disorders, 
diabetes, atiieiosclerosis, and cardiovascular disease (Isselbach^, K.J. et al. (1994) Harrison's 

IS Principles of Internal Medicine ^ McGraw-Hill, New Yoik NY, pp. 416-431, 1887). 

The casdnldnasel (CKI) gmefomily is anoth^ subfonnly of smne/tiireoxune protein 
kinases. This continuously oKpanding group of Idnases have been itqplicated in tiie regulation of 
num^ous cytoplasmic and nuclear processes, induding cell metabolism and DNA replication and 
repair. CKI enzymes are present in Ae mmbranes, nucleus, cytoplasm and cytosloeleton of 

20 eukaiyotic ceDs, and on the nmtotic spindles of mammalian ceils (Pish, KJ. ^ al. (1995) J. Biol. 
Chem. 270:14875-14883). 

The CKI fannly members all have a short amino-temnnal domain of 9-76 andno acids, a 
hig^y conserved kinase domain of 284 amino acids, and a variable carbox^-temnnal domain that 
ranges firom 24 to over 200 annno adds in length (Cegielska, A. et al. (1998) J. BioL Chem. 

25 273:1357-1364). The CKI family is comprised of highly related proteins, as seen by the identification 
of isofoims of casdbi kinase I from a variety of sources. There are at least five mammalian isoforms, 
a, p, Y> S\ and 8. Fish et al. identified CKI-epsilon from a human placenta cDNA library. It is a basic 
protein of 416 annno acids and is closest to CKI-delta. Through recombinant expression, it was 
det^mined to phosphorylate known CKI substrates and was inluBiia l^'ffielZKi-^pecific inhibitor 

30 CKI-7. The human gene for CKI-^psilon was able to rescue yeast with a slow-growth phenotype 
caused by deletion of the yeast CKI locus, HRI1250 (Fish et al,, supra). 

The mammalian circadian mutation tan was found to be a semidominant autosomal allele of 
CKI-epsUon that markedly shortens period length of circadian ifaythms in Syrian hamsters. The tau 
locus is encoded by casein kinase I-^silon, which is also a homolog of the Drosophila drcadian gene 

35 double-time. Studies of both the wildtype and tau mutant CKI-epsilon enzyme indicated that die 
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mutant enzyme has a noticeable reduction in the n^ixtmam vdoci^ and autophosphoiylation state. 
Farther, in vitro, CKI-epsilon is able to interact with nwmmalian PERIOD proteins, while the mutant 
enzyme is deficient in its ability to phosphorylate PERIOD. Lowiey et aL have proposed fliat CKI- 
epsilon plays a wsqoi role in delaying the negative feedback signal wifliin the transc^iption- 
5 transktion-basedautoregidatory loop that con[qK>sestiieco!re of Aeciic^ Theiefoie 
the CKI<^psilon en2ryme is an ideal target for pharmaceutical con^ 

rhy flimjg , jet-lag and sleeps in addition to otii^ physiologic and metabolic processes under ciicadian 
regulation (Lowiey, P.L. et aL (2000) Sdence 288:483-491). 

Homeodomain-interacting protein Idnases (HIPKs) are serine/dueonine kinases and novel 

10 meihbeis of die DYRK kinase subfamily CHofinann» T.G. et aL (2000) Biochimie 82: 1 123-1 127). 
HDPKs contain a conserved protein kinase domain separated from a domain tinat interacts wilb 
homeopmteins. HIPKs axe nuclear kinases, and HIPK2 is highly expressed in neuronal tissue QQm, 
Y.H. et id, (1998) J. BioL Chem. 273:25875-25879; Wang. Y. et al. (2001) Biochim. Biophys. Acta 
1518:168-172). HIPKs act as corepressors for honaeodonoian transcription factors. This corepressor 

15 activity is seen in posttranslational modifications such as ubiquitination and phosphorylation, each of 
which ate important in the regulation of ceUular protdm function (Kom, Y.H. et al. (1999) Pnic. Natl. 
Acad. Sci. USA 96:12350-12355). 

The human h-warts protdn, a homolog of Drosophila warts tumor suppressor g^oe^ maps to 
chromosome 6q24-25.1. Ithasaserine/threoniiiBkiiiasedomainandislocdizedtoceEitros 

20 inteaphase cells. It is involved in iidtosis and functions as a conqioneot of th^ 
(Nishiyama, Y. et aL (1999) FEBS Lett. 459:159-165). 
Caldum-Calmodulln Dependent Protein Kinases 

CalciuDVcalmodulin dependent (CaM) kinases are involved in regulation of smooth muscle 
contraction, glycogen breakdown (phosphorylase kinase), and neurotransmis^on (CaM kinase I and 

25 CaMkinasell). CaM dqpeaident protein kinases are activated by calmodulin, an intracellular calcium 
recqitor, in response to the concmtration of free caldum in the cei^ Many CaM kinases are also 
activated by phosphoiylati<»DL Some CaM kinases are also activated by autophosphorylation or by 

oth«' regulatory Idnases. CaMldnaselifliosphoiylatesavarielyiof substrates includi^ 

neurolraiisnutter^rdated protdns synapsin'I andli;'the gine transcription regulator, CREB, and die 

30 cystic fibrosis conductance regulator protean, CFTR (Haribabu, B. et al* (1995) EMBO J. 14:3679- 
3686). CaM kinase n also phosphorylates synapsin at different sites and controls the synthesis of 
catecholannnes in the brain tiirough phosphorylation and activation of tyrosine hydroxylase. CaM 
kinase n controls the synthesis of catecholamines and seratonin, through phosphorylation/activation 
of tyrosine hydroxylase and tiyptophan hydroxylase, respectively (Fujisawa, H. (1990) BioEssays 

35 12:27-29)* The mRNA encodmg a calmodulin-binding protein kinase-like protdn was found to be 
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mridiedminainmaUanforebruD. Tbis protein is associated 'imAveddesm 
dendiitBS and accunndates largdy postnatally. The amino add sequence of this prot^ is sinnlar to 
CaM-dq>«dent STKs, and fhe pioldnlnnds calmndnlininflie piesrace of calicianiCGodbout, M. ^ 
al. (1994) J. Nemosd. 14:1-13). 
5 »BtogM>Acttvated Protein Kinases . 

The nntogm-activAted pxotdoi kinases (MAF)» whidi noediate signal transduction ficom Ibe 
cell surface to the nucleus via pbospboiylation cascades, are another STK family that regulates 
intraceOnlar signaling pathway. Sevimd subgroups have heen idratified» and eadh maiufests 
diffefeot substrate spedfidttes and req[ionds to distinct atra 
10 Wdnbeig (1993) Natnie 365:781-783). Tbm are three kinase noodnles conqprising the MAP kinase 
cascade: MAPK (MAP), MAPK kinase (MAP2K, MAPKK, or MKK), and MKK Mnase (MAP3K, 
MAPKKK, OR MEKK) (Wang^S- et al (1998) Biochem. Biophys. Res. Comnnin. 253:33-37), The 
extracellular-regulated kinase (ERK) pathway is activated by growth factors and ndtog^, for 
exan^le, epidermal growth factor (EGF), ultraviolet light, hyperosmolar medium, heat shodk, or 
IS endotoxiclipopolysaccharide(LPS). The closdLy related though distinct parallel pathways, the c-Jun 
N-teminal kinase (JNK), or stress-activated kinase (SAPIO pattiway, and the p38 kinase pathway are 
activated by stress stimuli and proinflammatory cytokines such as tumor necrosis fsictor (TNF) and 
interleukin-1 (DL-l). Altered MAP kinase expression is inq^licated in a variety of disease conditions 
including cancer, inflammation, imiTntne disorders, and disorders affbcting growdi and development.. 
20 Map kinase signaling pathways are present in maTmnalian cells as well as in yeast 
Cvdin'Oenendent Protein Kinases 

The cyclinnJependent protein kinases (CDKs) are STKs that control the progression of cells 
through the cell cycle. The entry and exit of a cell from mitosis are regulated by the synthesis and 
destructionof a family of activating proteins called c^clins. Cydins are small regulatory protdns tihat 
25 bind to and activate CDKs» which then phosphoiylate and activate selected protdns involved in the 
mLtotic process. CDKs are niuqoe in that they require multiple inputs to become activated. In 
addition to cydin binding, CDK activation requires the i^osphoiylation of a specific threonine 
residue and the dephosjAiorylation of a specific tyrosine residue on the CDK. 

Another family of STKs associated mih the cdl-cycle-are the NIMA'(nevef in xin^ 
30 related kinases Q^eks). Both CDKs and Neks are involved in duplication^ maturation* and sq>aration 
of the nmcrotubule organizing cmt^, the ceotroscmie, in animal cells ^ly, A.M.^aL (1998) EMBO 
J. 17:470-481). 

Chcdroolnt and Cell Cvde Kinases 

In flie process of cell division, the ord^ and tinmig of ceill cycle transitions are under control 
35 of cell cycle checl^ints, which «isure that critical evrats such as DNA replication and cbromosonoe 
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segregation ate carded oat witti precision. If DNA is daim^d, e.g. by radiation* a cbeclqiinitt 
palbway is activated Oat arrests the cdllc^de to provide tiw If the damage is ^tensive, 

apoptosis is induced. In the absence of such dieckpoints, tiie damaged DNA is inbm 
cells ^^cb may cause proliferative disorders suc^ as cancer. Protdn Idnases play an inqportant role 

S in tiiis process. For example, a specific kinase, checkpoint kinase 1 (Cbkl), bas been id^itified in 
yeast and mammals* and is afativated by DNA damage in yeast ActivationofCbkl leads to tiie arrest 
of die cell at the G2/M transition (Sanc3i^, Y. et aL (1997) Sdmce 277:1497-1501). Specificalty, 
C3ikl phospboiylates the cell division cycle phospbatase CDC2S» inhibiting its normsQ fimction whidi 
is to depbosphor^ate and activate die cydin-dep^ident kinase Cdc2. Cdc2 activation controls fbe 

10 entry of cells into nntosis (Peng, C.-Y. et aL (1997) Science 277:1501-1505). Thus, activation of 
Chkl prevents the damaged ceD from entering nrntosis. A deficiency in a chec]q>oint kinase, such as 
Chkl , may also contribute to cancer by failure to anest cells with damaged DN A at other cbedqioints 
such as G2/M. 

Proliferation-Related Kinases 

15 Proliferatioiv-related kinase is a serum/cytokine inducible STK that is involved in regulation 

of the cell cycle and cell proliferation in human megakaxocytic ceils (li, B. et al. (1996) J. BioL 
CSiem. 27 1 : 19402-19408). ftoliferation-related kinase is related to the polo (derived from 
Drosophila polo gene) family of STKs iiiq>licated in cell division. Proliferation-related kinase is 
downregulated in lung tumor tissue and may be a proto-oncogene whose deregulated expression in 

20 normal tissue leads to oncogenic transfonmtion. 
S^-AMP'activatcd protein kinase 

A ligand-activated STK protein kinase is 5 -AMP-activated protein kinase (AMPK) (Gao, G. 
et al. (1996) J. Biol Chem. 271 :8675-8681). Mammalian AMPK is a regulator of fatty acid and sterol 
synthesis through phosphorylation of the enzymes acetyl-CoA carboxylase and 

25 hydioxymsthylgjutaiyl-CoA reductase and mediates responses of th^ pathways to celhilar stresses 
such as heat shock and depl^bn of ^ucose and ATP. AMPK is a heteiotrimmc complex concprised 
of a catalytic alpha subunit and two nonrcatalytic beta and gamma subunits diat are believed to 
regulate the activity of the alpha subumt. Subunits of AMPK have a much wider distribution in 
tton-lipogenic tissues such as^brain, heart, sple^ and:lung-than mpectedr Thisidisti^^ ' 

30 duit its role may extend beyond regulation of lipid metabolism alone. 
Kinases inAnoptosis 

Apoptosis is a hig|Uy regulated signaling pathway leading to cell death that plays a crucial 
role in tissue development and homeostasis. Dmgulation of this process is associated with the 
pathogenesis of a number of diseases including autoimmune diseases, neurodegenerative disorders, 
35 andcancer. Various STKs play key roles in this process. ZIP Idnase is an STK containii^ a 
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C-temrinal leucine zipper domain in aMtion to its N-ten^ TUs / 

C-teminal domain appears to mediate homodimerization and activation of the kinase as wdl as 
interacdoDS with transcription fectois such as activating transcription fector. ATF4. a memba- of fl» 
cydic-AMP responsive dement binding protein (ATF/CREB) fanfly of transcriptional factors 
(Sanjo. H. et aL (1998) J. BioL Chem. 273:29066-29071). DRAKl and DRAK2 are STKs that share 
homology with the death-associated protein Idnases (DAP Idnases). known to function in interfeon-Y 
induced apoptosis (Sanjo etal..s«;,r«). like ZIP kinase. DAP kinases contain a C-teiminal 
protein-protdn interaction domain, in Ihe form of ankyrin repeats, in addition to the N-terminal 
kinasedomain. ZIP. DAP. and DRAK kinases induce morphological changes assodated with ' 
apoptosisva«ntransfected into NIH3T3 cells (Saigoetal..5«^^^^^ However, deletion of dflxer the 
N-terminal kinase catalytic domain or the C-teiminal domain of these proteins abolishes apoptosis 
activity, indicatfa^ that in addition to the kinase activity, activity in the C-teiminal.domain is also 
necessary for apoptosis. possibly as an interacting domain with a regulator or a specific substrate. 

RICK is another STK recently idotified as mediating a specific apoptotic pathway involving 
Ihe death rec<^. CD95 (Jnohara. N. et aL (1998) J. Biol. Chem. 273:12296-12300). CD95 is a 
mcanb^ of die tumor necrosis factor receptor supiafamily and plays a critical lole in the regulation 
and homBOStasis of ae hnmune systemCNagata. S. (1997) Cell 88:355-365). The CD95 receptor 
signaling paaway involves recruitment of various intracdhdar molecules to a receptor complex 
fifllowingligandbinding. TWs process indndes recruitment of fliecystdne protease caspase-S 
v*idi, in turn, activates a caspase cascade leading to cdl death. RICKis cornposedof anN-temrinal 
kinase catalytic domain andaC-teiminal«caspase.rBcruitment'' domain ftatinteracte wifli 
caspase.likedomains.indicatinglhatRICKiflay5aioIeinaerecruitmentofca^ This 
interpretation is supported tqr the fact that tte expression of RICK in human 293Tcdls promotes 
activation of caspase^ and potentiates Oe induction of apoptosis by various protdns involved in ttie 
25 CD95 apoptosis pathway (InoharartaL,ji9?ra). 
Mitochondrial Protgin f^f n^cfc 

A novd class of eukaiyolic kinases, rdated by sequence to prokaryolic histidine protdn 
kinases, the mitochondrial protein kinases (MPKs) ^discern to have no sequence similarity 
wthjither.eukaiyotic protdn kinases. lliesepi»teinildnasesTtt&lb^ater*sc^ - — " 

mitochondrial matrix space and may have evolved from genes origimOly presoit in lespiiation. 
dq)endentbacteriawhidiwBreendocytosedbyprimitiveeukaiyoticcdls. MPKs are responsible for 
phosphorylation and inactivation of the brandied-diain ali*a-ketoadd ddiydiogenase and pyruvate 
ddiydrogenase complexes (Harris, R.A et al. (1995) Adv. Enzyme ReguL 34:147-162). Rve MPKs 
have been identified. Four members correspond to pyravate dAydrogenase kinase isozymes, 
regulating theactivi^ of the pymvateddiydiogenasecomiaex. whidi is an important rt^iy 
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razyxne at tiie interface between glycoly^ The fifUiniBnib^ corresponds to 

a branched-chain alpha-ketoacid ddiydrog^iiase kinase, important in Hie r^;iilation of the pathway for 
die disposal of brandied<hain annno adds, ^iarris, ILA. et al. (1997) Adv. Enzyme Regal. 37:271- 
293). Both starvation and the diabedc state are known to result in a great inm 
S die pyruvate dehydrogenase Idnase in the Uvcf, heart and muscle of the rat This increase contribntes 
in both disease states to the phosphorylation and inacdvalion of the pyruvate deirydrogenase con^lex 
and conservation of pyruvate and lactate for ghiconeogenesis (Harris (1995) supra). 

KINASES WITH NON-PROTEIN SUBSTRATES 

10 Lipid and Inositol kinases 

Lipid kmases phosphorylate hydros^l residues on li^d head groups. A fiEuid^ of kinases 
involved in phosphojylation of phosphatidylinositol (PI) has bem described, each member 
phosphorylating a specific carbon on the inositol ring (Leevers, S J. et aL (1999) Curr. Opin. CelL 
BioL 11:219-225). Hiephosphorylationof phosphatidylinositol is involved in activation of the 

IS protdn kinase C signaling pathway. The inositol phospholipids (phosphoinositides) intracellular 
signaling padiway begins with bindiag of a signaling molecule to a G-protdn linked receptor in the 
plasma membrane, lliis leads to the phosphorylation of phosphatidylinositol (PI) residues on the 
inner side of the plasma membrane by inositol kinases, thus converting PI residues to thebiphosphate 
state (PIP2). PIP2 is then cleaved into inositol triphosphate (IP3) and diacylglycerol. These two 

20 products act as mediators for separate signaling pathways. Cellular responses diat are mediated by 
these pathways are glycogen breakdown in the liver in response to vasopressin, smooth muscle 
contraction in response to acetylcholine, and thronohin-induced platelet aggregation. 

PI 3-kinase (PI3K), which phosphorylates the D3 position of PI and its derivative, has a 
central role in growth factor signal cascade involved in cell growth, differentiation, and metabolism. 

25 PI3K is a heterodimer consisting of an adapter sabunit and a catalytic subunit The adapts subunit 
acts as a scaffoldiiig protein, interacting with specific tyrosine-phosphorylated protdns, lipid 
moieties, and other cytosolic factors. When the adapter subunit binds tyrosine phosphorylated 
tar^^ts, such as the insulin responsive substrate (IRS)-l, the catatytic subunit is activated and converts 
PI (4,S) bisphosphate (PIP^ to PI (3»4,S) P3 (PIP3). PIPs dien activates -a-nunob^ of i:>tB^ protrau^, 

30 including PKA, prot^ Idnase B (PKB), protein kinase C (PKC), glycog^ synthase Idnase (GSK)-3, 
and p70 ribosomal s6 kinase. PBK also interacts direcOy ^th the cytoskeletal oiganizing proteins, 
Rac, rho, and cdc42 (Shepherd, P.R. et al. (1998) Biochem. J. 333:471-490). Animal models fat 
diabetes, sudi as obese zsAfat nmce, have altered PI3K adapter subunit levels. Spedfic mutations in 
the adapter subunit have also been found in an insulin*resistant Danish population, suggestinig a role 

35 for PI3K in type-2 diabetes (Shepard, supra). 
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An example of Upid kinase phosphoiylation activity is the phosphoiylation of 
D-«ythro.sphingosine to tbe spWngolipid metabolite. spMngosine-l -phosphate (SPP). SPP has 
anfaged as a novel lipid second-messoiger with both extracelhdar and intraceUular actions (Kohama. 
T. et al. (1998) J. BioL Chem. 273:23722-23728). Extracdhilarly, SPP is a Hgand for the G-protdn' 
5 coupled receptor EDG-l (endofhelial-derived, G-protein coupled receptor). IntraceBularly. SPP 
regulates cdl growth, survival, motiUty. and cytoskeletal changes. SPP levels are regulated by 
spWflgosineldnaseslhatspecificallyphosphoiylateD-er^ The importance of 

sphingosine kinase in cell signaling is indicated by flie feet that various stimuU. including 

pktdetHierived growlh factor (PDGR. nerve growth fector. and activation of protdn k^ 
10 i«5reaseceDularlevdsofSPPbyactivationofsphiAgosu^ 

inMbitors of the enzyme selectively inhibit cdl proliferation induced by PDGF (Kohama et al. 
supre[). 

Purine Nude ntide Khiasi^s 

The purine nucleotide kinases, adenylate kinase (ATP-.AMP phosphotransferase, or AdK) and 
15 fiWrlatekinase(ATP:GMPifliosph0transfenise,orGuK)0 

andareciucialtoaiesynfliesisandregnhitionof cellularlevc&of ATPandGTO^ respectively. These 
two molecules are i«ecuBon; in DNA and RNA synthesis in growii^cete 

««>«»rfbiochHnicaleneigy in ceBs (ATP), and signal tnmsd^ Inhfliitionof 
various steps in tiie syntiiesis of these two molecules has been tiie basis of many antiproliferative 

20 drugs for cancer and antiviral tiierapy (Pahvem. K. et al. (1990) Cancer Res. 50:1576-1579). 

AdK is found in ahnost an ccffl types and is espedalty abundant in cells havii« hi^ 
ATP synthesis and utilization such as skdetalmnsde. IntheseceMs AdK is i*ydcally associated 
wifli mitochondria and myofibrils, die subceDuIar structures ,tiiat are involved in energy production 
and utilization, respectively. Recent studies have demonstrated a major fonction for AdK m 

25 transferring Mgji energy phosphoiylsfrommetabcflic processes geneniting ATP to cdttular 

components consuming ATP (Zeleznikar. RJ. et aL (1995) J. BioL Chem. 270:7311-7319). Thus 
AdK may have a pivotal role in maintaining enogy production in ceDs. particularly ti»se having a 
high rate of growth or metabolism such as cancer cdls , and may provide a target for suppression of its 
. .. activity in order to^^t certain cancers. Altetnativdyrieduced AdK-activity-maybe-ff-seurcedf 

30 variousmetaboUc,muscle-energydisorderslhatcanresultincardiacorrespiratoryfi^ 
treatable by increasing AdK activity. 

GuK, in addition to providing a key step in ihe synfliesis of GTP for RNA and DNA 
synthesis, also folfills an essential function in signal tiansduction patiiways of ceDs through the 
regulation of GDP and GTP. Specificalty, GTP bindmg to membrane associated G proteins mediates 
35 the activation of cen receptors, subsequent intraceUular activation of adenyl cyclase, and production 



10 



FP.1722P 

of the second messenger, cyclic AMP, GDP bindiiig to G proteans iDhibits Oiese process GDP and 
GTP levels also control ttie activity of obtain oncogenic protdns such as p21"' known to be involved 
in control of cell proliferation and oncogenesis (Bos, J,L. (1989) Cancer Res. 49:4682-4689). High 
raticw of GTPiGDP caused by suppression of GuK cause activation of p21"' and promote 
5 oncogenesis. Increasing GuK activity to increase levels of GDP and reduce the GTP:GDP ratio may 
provide a therapeutic strategy to reverse oncogenesis. 

GuK is an in^ortant CTzyme in the phosphorylation and activation of certain antiviral drugs 
useful in the treatmrait of herpes virus hifections. These drugs include the guanine homologs 
acyclovir and buciclovir (Mill«-, W.H. and R.L. Miller (1980) J. Biol. Chem. 255:7204-7207; 

10 Stenberg, K. et al, (1986) J. Biol. Chem. 261:2134-2139). Increasing GuK activity in infected cells 
may provide a therapeutic strategy for augmenting die effectiveness of these drugs and possibly for 
reducing the necessary dosages of the drugs. 
Pvrlmldine Klnastis 

The pyrinndine kinases are deoxycytidine kinase and thymidine kinase 1 and 2. 

15 Deoxycytidine kinase is located in the nucleus, and thynridine kinase 1 and 2 are found in the cytosol 
(Johansson. M. et aL (1997) Proc. NatL Acad. Sci. USA 94:1 1941-1 1945). Phosphorylation of 
deoxyiibonucleosides by pyrinmdme kinases provides an alternative pathway for de novo synthesis of 
DNA precursors. The role of pyrinmdine kinases,- like purine kinases, in phos 
the activation of several dbemotherapeutically ingiortant nucleoside analogues (Amer E.S. and S. 

20 Eriksson (1995) FhannacoL Ther. 67:155-186). 

PHOSPHATASKS 

Protem phosphatases are generally characterized as dflier s^ine/iOireonuie- or tyrosme- 
specific based on flidr preferred phospho-annno acid substrate. However, some phosphatases (DSPs, 

25 for dual spedfidty phosphatases) can act on phosphorylated tyrosim, serine, or threonine residues. 
The protdn smneAhreonine pho^]hatases (PSPs) are important regulators of many cAMP-mediated 
hormone responses in cells. Protrin tyrosine phosphatases (PTPs) itoy a significant role in cett cycl^ 
and cell signaling processes. Anoth^ family of phosphatases is the acid phosphatase orhistidine acid 
phosphatase (HAP) family whose membm hydrolyze jdbosplmte esteis at'a'Saic pH~^^ 

30 PSPs are found in the cytosol, nucleus, and nutochondiia and in association with cytoskeletal 

and membranous structures in most tissues, especially die brain. Some PSPs require divalent cations, 
such as Ca^* or Mn^*, for activity, PSPs play moportant roles in glycogen metabolism, muscle 
contraction, protem synthesis, T ceU function, neuronal activity, oocyte maturation, and hqmtic 
metabolism (reviewed in Cohen, P. (1989) Annu. Rev. Biochem. 58:453-508). PSPs canbe separated 

35 into two classes. The PPP class includes PPl, PP2A. PP2B/calcineurin, PP4, PP5, PP6. and PP7. 
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Membm of this class aie conoposed of aliomDlogpiis catalytic sabunitbeaniiig a v^bigUy 
conserved signature sequence, coupled wSi one or jnore regidatoiy subunits (PROSITB 
PDOCXM)! IS). FUrOier intmcdons vnOx scaffold and ancboriog molecules d^enmne the intracellular' 
localization of PSPs viad substrate specificily. Tbe PPM class consists of several clos^ related 
5 isoforms of PP2C and is evolutionarily unrdated to ttie PPP class. 

PPl dq;diosphorylatBS naasy of the protdns {diospboiylated by cyclic AMP-depend^ protdnr 
kinase (PKA) and is aninyortantregidatorof nmtny cAMP^nasdiatedhornQone A 
nunober of isofonns bave been idoidlied, vnfh the alpba and beta fonns h&ng produced by alternative 
splidng of the same gene. BottLUbiqpitousandttssue-specifictaigedpgprotdns for PPl have bem 
10 identified. Iafliebrain,iidbibitionof PPl activity by fliedopanme and admosi^ 

monophospbate-regnlated pbospboprotdn of 32kDa CDARPP-32) is necessary fior nom^ dopamine 
response in neostriatal neurons (reviewed in Price, N.E. and M.C. Muniby (1999) Cutr. Opin. 
NeurobloL 9:336-342). PPl, aloc^ with PP2A* has been shown to liimt motility in ndcrovascular 
CTdothelial c^, suggesting a role for PSPs in die inhibition of angiogenesis (Gabel, S. et aL (1999) 
is Otolaryngol. Head Neck Siirg.121 :463-468). 

PP2A is the main serine/threonine phosphatase. The core PP2A enzyme consists of a single 
36 kDa catalytic subunit (C) associated widi a 65 kDa scaffold subnnit (A), whose role is to recruit 
additional regulatory subunits (B). Three gene families encoding B subunits are known (PR55, PR61, 
and PR72X each of which contain multiple isofonns, and additional families may exist (Millward, 
20 T.A et al. (1999) Trends BioscL 24:186-191). These "B-type*' subunits are cell type- and tissue- 
specific and determine the substrate specificity, enzymatic activity, and subcellular localization of the 
holoeozyme. The PR55 family is highly conserved and bears a conserved motif (PROSITE 
PDOC00783). PR55 increases PP2A activity toward nntogen-activated protein kinase (MAPK) and 
MAPK kinase OMEK). PP2A dephosphorylates the MAPK active site, inhibiting the cell's entry into 
25 mitosis. Several proteins can c0nq)ete v^th PR55 for PP2A core enzyme binding, including the CKH 
Idnase catalytic subunit, polyomaviras middle and small T antigens, and SV40 small t antigra. 
Virases may use this mechanism to comnoandeer PP2 A and stimulate progression of die cell throng 
the cell cycle (Pallas, D.C. et al. (1992) J. Virol. 66:886-893). Altered MAP kinase expression is also 
implicated in a variety of disease conditions -including cancerrinflamgnationT immune disofd^s, and 
30 ' disordm affectii^ growtii and developmenL PP2 A, in £&ct, can dq>hosphorylate and modulate die 
activities of more tiian 30 protdn kinases in vitrot and other evidence suggests that die same is true in 
vivo for such kinases as PKB, PKC, the calmodulin-depradent kinases, ERK family MAP kinases, 
qrdinrdepmdent kinases, and the IkB kinases (reviewed in Millward et al., suprd). PP2A is itself a 
substrate for CKI and CKD kinases, and can be stimulated by polycationic macromolecnles. A PP2 A- 
35 like phosphatase is necessary to tyiglfitftin die Gl phase destraction of mammalian cyclins A and B 
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(Bastians. H. et aL (1999) Mot Biol. CeH 10:3927-3941). PP2A is a major activity in the brain and is 
inqilicated in regulating nemofflament stability and normal neural fiuiction, particular^ flie 
phosphoiylation of the nncsnitubule-assocdated protdn tan. Hyperphosphorylation of tau has hem, 
proposed to lead to Qie neuronal degexteration seen in Alzhehn^'s disease (reviewed in Pace and 
S Muidby , suprd), 

PP2B, or calcinenfin, is a Ca^-activated dim^c phosphatase and is particidaily abnndant in 
Oiebrain. It consists of catalytic and regidatorysubumts, and is activated by flie bind 
calciuni/'calniodulin con^loK. Calcineorin is flie target of flie immunosuppressant drugs cyclosporine 
and FKS06. Along with other cdlidar factors, fliese drags interact with caldneorin and inhibit 
10 phosphatase activity. In T cells, this blocks the calcium dq)eDdent activation of tiie NF-AT family of 
transcription foctors, leading to immniiosnppression. lliis famQ 

that csdcineuiin regnlatBS gene expression in otiij^ tissues as well. In neurons, calcineurin modulates 
functions which range from the inhibition of neurotransnHtter release to desensitization of 
postsynaptic NMDA*receptor coupled calcium channels to long term memory (reviewed in Price and 
15 Mumby, supra). 

Other members of the PPP class iave recently been identified (Cohen, P.T. (1997) Trends 
Biochem. Sci. 22:245-251). One of them, PP5, contains regulatory domains with tetratricopeptide 
repeats. It can be activated by polyunsaturated fatty acids and anionic phospholipids in vitro and 
appears to be involved in a number of signaling pathways, including those controlled by atrial 
20 natiiuretic peptide or steroid hormones (reviewed in Andreeva, AV. and M. A Kumzov (1999) CeD 
Signal. 11:555-562). 

PP2C is a -42kDa monomer with broad substrate specificity and is dependent on divalent 
cations (mainly Mtf* or Mg^^ for its activity. PP2C protdns share a conserved N-temsnal region 
with an invariant DGH motif, wbich contains an aspartate residue involved in cation binding 

25 (PROSITE PDOC00792 j. Targeting protdns and mecbamsms regulating PP2C activity have not 
been identified PP2C has bera shown to inhibit die stress-responsive p38 and Jon kinase (JNIQ 
patiiways (Takekawa, M. et aL (1998) EMBO J. 17:4744-4752). 

In contrast to PSPs, tyrosine-specific phosphatases (PTPs) are generally monommc proteins 
of v^ c£verse size (from2QkDa to greater dian 100kDa>and.structare tiiat fimctionpriitfiuily'in the " 

30 transduction of signals across the plasma menibrane. PTPs are categorized as eitii^ soluble 

phosphatases or transmecdbrane receptor protdns that contain a phosphatase domain. AH PTPs share 
a conserved catalytic domain of about 300 anuno acids vMdb contains the active site. The active site 
consensus sequence includes a cystine reddue which executes a nucleojdiilic attack on the phosphate 
moiety duribr^ catalysis (Neel, B.G. and N.K. Tonks (1997) Curr. Qpin. Ck^ll BioL 9:193-204) . 

35 Receptor PTPs are made np of an N-temmnal extracellular domain of variable lengdi, a 
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• transnienibrane tegion, and a cytoplasimc region that generally contains two copies of die catalytic 
domain. Altiboiig^ o2dy the first copy se^ns to have enzymatic 

aCbcts the substrate specificity of &e first The extracelliflar donmins of some rccqitor PTPs cont^ 
fibronectin^like lepeats, immunoglobiilin-like domains, MAM domains (an extracdhdar motif lilody 
5 to have an adhesive function), or carbonic anbydrase-like dom^ns CPROSITE PDOC 00323). llus 
wide variety of stroctond motifs accounts for die di v^sity in size and specifidty of PTPs . ^ 

PTPs play in^ortant roles in biological process^ sucdi as cell adhesion, lynphocyte 
acdvation, and cell prolifmtion. PTPs ft and k are involved in cell-cdl contacts, pediaps regulating 
cadhmn/catemn function. A number of PTPs affect cell spreading, focal adhesions, and cell motility, 
10 most of them via the integrin/Qrrosine kinase signaling pathway (reviewed in Ned and Tohks, suprdi. 
CD4S phosphatases regulate dgnal transduction and lynq^ihocyte activation (Ledbetter, J.A. et al. 
(1988) Pioc. Nafl.- Acad ScL USA 85:8628-8632). Soluble PTPs containing Src-homDlogy-2 
domains have bem id^otified (SHPs), suggesting that these molecules nngbt interact with receptor 
tyrosine kinases. SHP-1 regulates cytokine receptor signaling by controlling the Janus fannly PTKs 
15 in h^natopoietic cells, as well as signaliog by the T*ce31 rec^tor and c-Kit (reviewed in Neel and 
Tonks, supra). M-phase inducer phosphatase plays a key role in the induction of ncdtosis by 
dq)hosphorylating and activating the PTK CDC2, leading to ceQ division (Sadhu, K. et al. (1990) 
Proc. Nafl. Acad. Sci. USA 87:5139-5143). In addition, the genes encoding at least dght PTPs have 
been maj^ed to chromosomal regions that are translocated or rearranged in various neoplastic 
20 conditions, including lynq>homa, small cell lung carcinoma, leukemia, adenocarcinoma, and 

neuroblastoma (reviewed in Charbonneau, H. and N.K. Tonks (1992) Annu. Rev. Cdl Biol. 8:463- 
493). The FTP enzyme active site con^rises the consensus sequence of the MTMl gene fancnly. The 
MTMl gene is responsible for X-linked recessive nQrotubular myopathy, a congenital muscle* 
disorder that has been linked to Xq28 (Kioschis, P. et al., (1998) Genontucs 54:256-266). Maay PTKs 
25 are encoded by oncogenes, and it is well Imown that oncogeneds is oftmaccon^ 

tyrosine phosphorylation activity. It is therefore possible that PTPs may s^e to prevent or reverse 
cell transformation and the growtii of various cancm by controllixig tiie levels of tyrosine 
phosphorylation in cdls. This is supported by stupes showing that overexpresdon of FTP can 
suppress transfbrmation in ceDs and that-specific-inhibition of PTP can'€Qblinc6l:ell~tnu^ 
30 (Charboxmeau and Tonks, siqfrd). 

Dual specificity phosphatases (DSPs) ate structurally more sinmlar to tiie PTPs than the PSPs. 
DSPs bear an extended PTP active site motif with an additional 7 amino acid residues. DSPs are 
primarily assodated vnOi cell proliferation and include the cell c^cle regulators cdc25A, B, and C. 
The phosphatases DUSPl and DUSP2 inactivate the MAPK fanuly wmbm ERK (extracellalar 
35 signal-regulated kinase), JNK (c-Jun N-temunal kinase), and p38 on both tyrosine and tiueordne 
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residues (PROSITE PDCX^ 00323, supra). In tbe activated state, tbese Idnases have bem iiqdicated 
in ofimonal differeatiatioii, piolifoatian, oncogMc transformation, platelet aggregation, and 
apoptosis. Thus, DSPs are necessary for prop^ regulation of these processes (Muda, M. et al. (1996) 
J. BioL Choa 271 :2720S-27208). The tumor suppressor PTEN is a DSP Uiat also shov^ lipid 
S i&osphatase activity. It seraais to negatively r^qlateint^actionsi^flittieextracdlidarmati^ 

noaintains sensitivity to apoptosis. PTEN has be» implicated in the prevention of angiogenesis(Giri, 
D. and M. Ittmann (1999) Hum PalhoL 30:419-424) and abnormalities in its expression are 
associated nnmenms cancears (reviewed in Tansura, M. et al. (1999) J. NafL Canc^ Inst 
91:1820-1828). 

10 Histidine acid phosphatase (HAP; EXPASY EC 3.1.3.2), also known as add phosphatase, 

hydiotyzes a wide spectnunof substrates includiog aO^l, aryl, and acyl orthophospbate monoesters 
and phosphorylated proteins at low pH. HAPs share two legjions of conserved sequences, each 
centered around a histidine residue which is involved in catalytic activity. Mralbers of the HAP 
&mily include lysosomal acid phosphatase (LAP) and prostatic acid phosphatase (PAP), both 

15 s^itive to inhibition by L-tartrate (PROSITE PDOC00538). 

Synaptojanin, a polyphosphoinositide phosphatase, dephosphorylates phosphoinositides at 
positions 3, 4 and 5 of die inositol ring. Synaptojanin is a niajor presynaptic protein found at clathrin- 
coated endocytic intermediates in nerve terminals, and binds the clatimn coat-associated protein, 
EPS 15. This binding is mediated by the C-terminal region of synaptojanin-170, which has 3 Asp-Pro- 

20 Phe annuo acid repeats. Further, this 3 residue repeat had been found to be the binding site for the 
EH domains of EPS15 (Haffioer, C. et al. (1997) FEBS Lett 419:175-180). AdditionaUy, 
synaptojanin may pot^3tially regulate interactions of endocytic proteins with the plasma membrane, 
and be involved ia synaptic vesicle recycling (Brodin, L. et al. (2000) Curr. Opin. Neurobiol. 10:312- 
320). Studies in mice with a targeted disruption in the synaptojanin 1 gme (Synjl) w^e shown to 

25 support coat formation of mdoc^tic vesicles more dUsctiveOly dian was seen in wild-typo nuce, 

suggesting that Synjl can act as a negative regulator of membrane-coat pjotein intmctions. These 
findings provide genetic evidence for a crucial role of phosphoinositide metabolism in synaptic 
vesicle reqrcling (Crenxina, 0« et al. (1999) Cdl 99:179-188). 

FhrpuBfiginn pmftlinpr . ^ ... - rr — . 

30 Microarraysaie analytical tools used in bioanalysis. A microarray has a plurality of 

noolecules spatially distributed over, and stably assodated with, die surfSeice of a solid support 
Microarrays of polypeptides, polynndeotides, and/or antibodies have been devdoped and find use in 
a variety of applications, sudh as gene sequendng, monitoring gene expression, gene mapping, 
bacterial idmtification, drug discov^, and combinatorial chemstry* 

35 One area in particular in which microarrays find use is in gene expression analysis. Array 
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tedmology can provide a siII^>le way to explore the expression of a single polymojcpbic gene or the 

expiessionproffleof alargenimiberofidatedorunrdatedg^ies. Wh» the expression of a single 

gene is exaxnuied, arrays are employed to detect the expression of a specific gene or its variants. 

Vnxeoi an espres^on profile is exanuned, arrays provide si platform for idaittf^ng genes that are 
5 tissoe specific, aie affected by a substance being tested in a toxicology assay, are part of a sigoaliQg 

cascade, carry out h0nsek6q>i^g functions, or are spedficaBy related to a particular gen^c 

predisposition, condition, disease, or disorder. 

Tbum is a need in tihe art for new con^sidons, including nuclric adds and protons, for the 

diagnosis, prevention, and treatno^ of cardiovascular diseases, iimmne syst^i disordm, 
10 neurological disorders, disorders afiecting growdi and developing lipid disordears, cell proliferative 

disorders, and cancm. 

SUMl^RY OF THE INVENTION 
Various enibodfinMSDts of the invention provide purified polypq>ddes, Idnases and 
15 phosphatases, referred to collective^ as *KPP* and individually as and nosOiods for using 

these protons and their encoding polynucleotides for flie detection, diagnosis, and treatment of 
leases and ludical conditions. Einboditnmts also provide methods for utilr^ng the purified 
Idnases and phosphatases and/or their encoding polynucleotides for facilitating the drug discovery 
process, including determination of efBcacy, dosage, toxicity, and pharmacology. Related 
20 embodiments provide me^ds for utilizing the purified kinases and phosphatases and/or their 
encoding polynucleotides for investigating the pathogenesis of diseases and medical conditions. 

An ennbodiment provides an isolated polypeptide selected from the group consisting of a) a • 
polypeptide conqirising an amino acid sequence selected from the group consisting of SEQ ID NO:l, 
b) a polypq>tide comprising a naturally occurring axxdno acid sequence at least 90% id^itical or at 
25 least about 90% identical to an annuo acid sequence selected from the group consisting of SEQ ID 
NO:l, c) a biologically active fragment of a polypeptide having an anund acid sequence selected fixim 
the group consisting of SEQ ID NO: 1 , and d) an immunogenic fragment of a polypeptide ha^rang an 
amino acid sequence selected from the group consisting of SEQ ID NO:l, Another enibodim^t 
provides an isolated polypeptide conqirising anamino acid-sequence of SEQ ID NO:!. 
30 Still another enibodiment provides an isolated polynucleotide encoding a polypeptide 

selected from the group consisting of a) a polypeptide comprising an anuno acid sequence sdectied 
from the group consisting of SEQ ID NO:l , V) a polypeptide comprising a naturally occurring amino 
add sequence at least 90% id^cal or at least about 90% idmtical to an anmo acid sequence 
selected from the group consistiog of SEQ ID NO:l, c) a biolo^cally active fitigmoit of a 
35 polypeptide having an anuno add sequence selected from the group consisting of SEQ ID NO:l, and 
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d) an imanuiogeiiic fragooBixt of a polypeptide haviag an amino acid sequence sdected firom fhe group 
CQodstiiigof SEQIDNO:!. In another 6a3bodimBiit»aepolyiindeotid6eac^ 
selected fromthe group comistiiig of SEQ ID NO:l. In an alternative embodimeot, the 
polynucleotide is sdected from Oie group cousisting of SEQ ID NO:2. 

S Stin anothm^ eoabodim^ provides a recombinant polynucleotide con^prisii]^ a promote 

sequmce opmbty linked to a polynucleotide encoding a pol^ypeptide selected from Oie group 
conasting of a) a polypeptide conqnising ananmno add sequmoe selected fromttie group consisting 
of SEQ ID IfO:l, b) a polypeptide compiisuig a natural^ occurring amino add sequmce at least 90% 
identical or at least about 90% identical to an an&no add sequence selected from the group consisting 

10 of SEQ ID NO:l, c) a biologically active fragment of a polyp^tide having an annuo add sequence 
selected fromflie group consisting of SEQ ID NO:l, and d) animmunog^c firagmmt of a 
pol^eptide having an annno add sequmce sdected from the group consisting of SEQ ID NO:l. 
Another ^xibodiment provides a cell transfbrmed vfifh Hie recombinant polynucleotide. Yet another 
embodiment provides a transgenic organism conqprising fhe recombinant polynucleotide. 

15 Anodu^ embodimmt provides a meOiod for producing a po^rpeptide sdected fiFomtlie group 

consisting of a) a polypeptide conqxrising an amino add sequence selected from die group consisting 
of SEQ ID NO:l, b) a polypeptic^ conqiiising a naturally occurring amino acid sequence at least 90% 
idmtical or at least about 90% identical to an amino acid sequence selected from the group consisting 
of SEQ ID NO:l , c) a biologically active fragment of a polypeptide having an annno acid sequence 

20 sdected from the group consisting of SEQ ED NO:l » and d) an immunogenic fragment of a 

polypeptide having an armno acid sequence selected from the group consisting of SEQ ED NO:l. Ihe 
method conqprises a) culturing a cell under conditions suitable for expression of the polyp^tide, 
wherdn said cell is transformed with a recombinant polynucleotide comprising a promoter sequence 
operably linked to a polynudeotide encoding the polypeptide, andb) recovering the polypeptide so 

25 expressed. 

Yet another enoibodiment provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide conq)risiilg an anino add 
sequ«ce selected from the group consisting of SEQ ID NO:l> b) a polypeptide comprising a naturally 
occurring armno acid sequence at least 90% idratical-or arleasral>dut'90%~u!efiticdlb an ansino acid 

30 sequence selected from the group consisting of SEQ ID NO:l , c) a biologically active fragmrait of a 
polypeptide having an annno acid sequence selected fromflbte group consisting of SEQ ID NO:l, and 
d) an immunog^c fragnxait of a polypeptide having an anuno acid sequence sdected from the group 
consisting of SEQ ID NO:l. 

Stin yet another «ibodim»t provides an isolated polynucleotide selected from the group 

35 consisting of a) a polynudeotide conqprising a polynucleotide sequence selected from tiie group 
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coDsisling of SEQ ID N02, b) a polymcleotide conqmsipg a naturally occaxring polynucleotide 
sequence at least 90% identical or at least about 90% identical to a polynucleotide sequence sdected 
from the group consistii^ of SEQ ID NO:2, c) a polynucleotide conqplesn^itaiy to die polynucleotide 
of a), d) a polynucleotide conq>len)Bntaxy to the potynucleotide of b), and e) an RNA equivalent of a)- 
5 d). In other embodiments, die polynucleotide can comprise at least about 20. 30, 40, «), 80, or 100 
contiguous nucleotides. 

Yet another OTJbodiment provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide being selected fixim the group consisting of a) a polynucleotide conq>rising 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:2, b) a polynucleotide 

10 con5)rising a naturally occuning polynucleotide sequence at least 90% identical or at least about 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:2, c) a 
polynucleotide con5>lementary to the polynucleotide of a), d) a polynucleotide conq)l«nBntaiy to die 
polynucleotide of b), and e) an RNA equivalent of a)-d). The method con5>iises a) hybridiang the 
sample with a probe comprising at least 20 contiguous nucleotides comprising a sequrace 

15 conqdementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 
said target polynucleotide, under conditions whereby a hybridization con[^Iex is fom^d betwem said 
probe and said target polynucleotide or fragments thereof, and b) detectmg die presence or absence of 
said hybridization conqslex. In a related embodiment, the mediod can include detectmg the amount of 
the hybridization conylex. In still oth^ endbodimmts, the probe can comprise at least about 20, 30, 

20 40, 60, 80, or 100 contiguous nucleotides. 

Still yet another enobodimirait provides a mi^od for detecting a target polynucleotide in a 
sample, said target polynucleotide bong selected from the group consisting of a) a polynucleotide 
comprisiij^ a polynucleotide sequence selected from the group consisting of SEQ IDNO:2, b) a 
polynucleotide oonq>iishig a naturally ocburripg polynucleotide sequence at least 90% identical or at 

25 least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:2, c) a polynucleotide conqplementaiy to the polynucleotide of a), d) a polynucleotide 
conqplemootaiy to the polynucleotide of b), and e) an RNA equivalent of a>d). The n^thod 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase cham 
reaction anq>lification, and b) detecting'the presmce or absehcebf sdid^^lifiea t^et '^^ ' 

30 polynucleotide or fragment thereof. In a rdated embodiment, die mediod can include detecting die 
amount of the amplified taxget polynucleotide or fragment thereof. 

Anoth^ eDGbodiment provides a composition conqpiising an effective amount of a 
polypeptide selected from the group consisting of a) a polypeptide conqirising an atnino acid 
sequence selected from die group consistii^ of SEQ ID NO:l , b) a polypqitide comprising a naturally 

35 occuning amino acid sequence at least 90% id^itical or at least about 90% identical to an amino acid 
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sequence selected from the group ccxnsistipg of SEQ ID NOrl, c) a biologically active firagmrat of a 
polypeptide having an axmno acid sequoce selected firomtfie group consisliag of SEQ ID NO:l« and 
d) an inntamo^uc firagmsnt of a polypeptide haviiig an anonno acid sequence sdected fiomthe group 
consisting of SEQ ID NO:l, and a phaimaceuticany acceptahle excipient In one embodiment, flie 

5 composition can conqnrise an amino acid sequrace selected £nom die group consisting of S EQ ID 
N0:1. Odier CTbodiments pro^dde a method of treating a disease or condition associated vnftx 
decreased or abnormal expression of funcdonal KPP, compiising adnmustering to a patient in need of 
such treatment the conqposition. 

Afio flter embodiment provides a method for soreening a compound for efEecti v^iess as an 

10 agonist of a polypeptide sdected firomflie group consistii^ of a) a polypeptide con^risii^ an amino 
acid sequCTce sdected firom die group consisting of SEQ ID NO:l, b) a polypeptide comprising a 
naturally occurring anmno acid sequrace at least 90% identical or at least about 90% idratical to an 
anmo add sequooce selected from die group consisting of SEQ ID NO:l, c) a biologically active 
fragment of a potypqptide having an amino add sequence selected from the group consistixig of SEQ 

15 ID NO:l, and d) an immunogenic fragment of a polypqitide having an amino acid sequence sdected 
from the group consisting of SEQ ID NO:l. The method comprises a) contacting a sanqde 
conq)rising the polypeptide widi a conqpound, and b) detecting agonist activity in the sample. 
Anoth^ embodimrait provides a conqKisition comprising an agonist compoimd identified by the 
method and a pharmaceutically acceptable excipient Yet another endbodiment provides a method of 

20 tieating a disease or condition associated with decreased expression of functional KPP, comprising 
administering to a patient in need of such treatment the conoposition. 

Still yet aiu>ther embodiment provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:l, b) a polypeptide 

25 conq>rising a naturally occurring amino acid sequence at least 90% identical or at least about 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l , c) a 
biologically active fragment of a polypeptide having an annno acid sequrace selected from die group 
consisting of SEQ ID NO:l> and d) an immunogenic fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1. Tfie fediod con5>rises a) contacting a 

30 sample comprising the polypeptide ^th a conipound, and b) detecting antagonist activity in the 

sanople. Another enibodim^ provides a con;)Osition concprising an antagonist conqpound identified 
by die m&thod and a pharmaceutically acceptable excipient Yet another enibodiment provides a 
method of treating a disease or condition associated with overexpression of functional KPP, 
con^rising adnnnistering to a patient in need of such treatment the conqvosition* 

35 Another embodimi^t provides a method of screening for a conq>ound diat specifically binds 
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to a polypeptide sdected fiomfhe groop coo^tiiig of a) a polypeptide conqaisiiig an aBaino acid 
sequ»ce selected from Oie group consistiogof SBQ ID NO:l,b) a polypeptide coiqprisiiig a natorally 
occuniD^ ainino acid sequeoce at least 90% id»ttcal or at least about 90% idratical to an ammo add 
sequence selected from fire gnmp counting of SEQ ID NO;l, c) a biologically active fragmeot of a 

5 polyp^tide having an annuo acid sequence selected from the group conasting of SEQ ID NO;l, and 
d) an immunogenic firagment of a polyp^tide having an amino acid sequence sdected from flie group 
consistiqg of SEQ ID NO:l. The mediod conqirises a) combining the polypeptide with at least one 
t^ conqpound undm' suitable conditions, and b) detecting binding of tfie polypeptide to the test 
con^und. thereby idenlif^g a conqiound that specifically binds to the polypeptide. 

10 Yet another embodiment provides a melhod of scieening for a compound that modulates the 

activity of a polypqitide selected from the group consisting of a) a polypeptide conprisiiig an amino 
acid sequence selected IGromthe group consisting of SEQ ID NO:l, b) a polypeptide conqprisioe a 
naturally occoning amino add sequence at least 90% identical or at least about 90% idmtical to an 
amino add sequence selected from the group con^stii^ of SEQ ID NO:l, c) a biologtcaOy active 

15 ftagment of a polypeptide having an andno add sequence selected from the group consisting of SEQ 
ID NO:l, and d) an immunogenic fragment of a polypeptide having an annno acid sequence selected 
from the group consisting of SEQ ED NO:l . The method comprises a) combining tiie polypeptide 
with at least one test compound mder conditions permissive for the activity of the polypeptide, b) 
assessing the activity of the polypeptide in the presence of the test compound, and c) comparing the 

20 activity of the polypeptide in tiie presence of the test compoimd with the activity of the polypeptide in 
' the absence of the test compound, wherein a change in tiie activity of the polypeptide in the presence 
of the test compotmd is indicative of a conq^oxmd that modulates the activity of the polypeptide. 

Still yet another embodiment provides a method for screening a compound for effectiveness 
in altering expression of a target polynucleotide, wherein said target polynucleotide coinprises a 

25 polynucleotide sequence selected from the group consisting of SEQ ID NO:2, the method con^rising 

a) contacting a sanq)le coirq)rising the target polynucleotide with a compound, b) detecting altered 
expression of the target polynucleotide, and c) comparing the expression of the target polynucleotide 
in the presence of varying amounts of the confound and in the absence of the conqiound. 

Another embodiment provides a-method for assessing tbxid^ of a ~t£t concy^ound, said 
30 method con9>rising a) treating a biological sample containing nucldc adds with the test conqpound; 

b) hybridizing the nucleic adds of the treated biolo^cal saniple with a probe con^rising at least 20 
contiguous nucleotides of a polynudeotide selected firomthe group consisting of i) a polynucleotide 
conqirising a polynucleotide sequence sdected firomthe group consisting of SEQ ID NO:2, ii) a 
polynucleotide con^irising a naturally occuniiig polynucleotide sequence at least 90% identical or at 

3S least about 90% idmtical to a polynucleotide sequence selected firom the group consistiDg of SEQ ID 
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NO:2, iii) a polynucleotide ha\dng a sequence conq>le]X]eiitary to i), iv) a polynucleotide 
conqilenientafy to the polynucleotide of ii), and v) an RN A equivalent of i>iv). Hybridization occurs 
under conditions whereby a specific hybridization conqplex is formed between said probe and a target 
polynucleotide in the biological sample, said taiget polynucleotide selected from &e gronp consisting 

5 of i) a polynucleotide conq>rising a polynucleotide sequence selected from the gronp consisting of 
SEQ ID NO:2, ii) a polynucleotide comprising a naturally occuning polynucleotide sequence at least 
90% identical or at least about 90% identical to a polynucleotide sequence selected firom the gionp 
consisting of SEQ ID NO:2, iii) a polynucleotide conq>lementaiy to the polynucleotide of i), iv) a 
polynucleotide complementary to die polynucleotide of ii), and v) an RNA equivalent of iH^). 

10 Alternatively, die target polynucleotide can comprise a fragment of a polynucleotide selected firom 
the gronp consisting of i)-v) above; c) quantifying (he amount of hybridization complex; and d) 
comparing the <^twftntit of hybridizafion conqplex in the treated biological san^le wifli the amount' of 
hybridization conqdex in an untreated biological sample, wherein a difference in the amount of 
hybridization complex in the treated biological sample is in^cative of toxicity of the test compound. 

15 

BRIEF DESCRIPTION OF THE TABLES 
Table 1 stmomarizes the namenclature for fun leag/h polynucleotide and polypeptide 
embodiments of the invention. 

Table 2 shows the GenBank identification nunfl>er and annotation of tiie nearest GenBank. 
20 homolog, and the PROTEOMB database idootification numbers and annotations of PROTBOMB 
database homologs, for polypeptide enibodimraits of the inveaitiaa The probability scores for the 
matches between each polypeptide and its homologCs) are also shovnoL 

Table 3 shows structural features of polypeptide endbodiments, including predicted motifs 
anil domains* along with the methods, algorithms, and searchable databases used for analysis of the 
25 polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assCTdble 
polynucleotide ftwihodimftntg, along with selected fragmraits of the polynucleotides. 

Tables diows representative cDNA libraries for polynucleotide embodiments. , 

Table 6 provides an appendix whidx d^cribes the'ti^es.and vectors used for construction of 
30 die cDN A libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze polynucleotides and 
polypeptides, along with applicable descriptions, references, and threshold parameters. 

^ Table 8 shows single nucleotide polymorphisms found in polynucleotide sequences of the 
invention, along witb allele frequencies in different human populations. 

35 
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DESCRIPTION OF THE INVENTION 
Brfore die present proteins, nucleic adds, and meSiDds aie described, it is understood that 
CTEibodkDKits of fhe invrntion are not limited to die pardcular machines, instnimraits, noaterials , and 
n^fhods described, as Aese nmy vary. Itis dso tobeundostoodfhatdi&teramiologyosedhminis 
5 for the purpose of descrihiiig particular eoibodinients only, and is not int»ded to linmt the scope of 
die invention. 

As used herdn and in file appended claims, die singidar forms *^a,** ^^an,** and '^e'* include 
plural refermce unless fhecontiat clearly dictates otb Tims, for exanqile, a refl»mce to ''a 

IiDSt cdl** inchides a plurality of sudi host cells,} and a tdieimc& to "^an antibodjr" is a refinence to one 

10 or more antibodies and equivalents th^eof known to those skilled in die art, and so forth. 

Unless defined odierwise, all technical and scimtific terms used heran have die same 
wieantfig^ as conuDonly undostood by one of ordinary sldll indie art to vMkSx diis invention ImIoo^. 
AKhougjh assy Tnarthineg^ tnatcrialg^ and noeOiods jgifnilar or equivsdCTt to those described heiein can be 
.'used to practice or test the preset invention, the preferred nmchines, materials and methods are now 

15 described. AUpubUcadons xnentionedhCTdnarecitedforfhepuiposeof desciib 

the ceU lines, pnrtocols, reagents and vectors whidi are reported indie publications and vrbidb. might 

in c^npgGtiftn varjofiig enrihn dfTnent& of ^hfe invention- Nothu^ herein iS to be COnStTUed 

as an adnds^on that d^e invention is not entided to antedate sucih disclosure by virtue of prior 
invention. 
20 DBFINmONS 

'*KPP" refers to die amino acid sequences of substantially purified KPP obtained from any 
^ecks, particularly a mammalian species, including bovine, ovine, porcine, moorine, equine, and 
human, and from any source, whether natural, synthetic, senn-synthetic, or recombinant 

The term ^'agonist" refers to a molecule which intensifies or nodmics the biological activity of 
25 KPP. Agonists may include proteins, nucleic acids, carbohydrates, smaU molecules, or any other 
confound or con^osition which modulates the activity of KPP either by directly interacting with 
KPP or by acting on cona|>onents of the biological pathway in which KPP participates. 

An "allelic variant" is an alternative form of the gene encoding KPP. Allelic variants may 
result from at least one mutation in the nucleic acid sequence mdlrayrSult in altered mRNAs or in 
30 polypeptides whose structure or function may or may not be altered. A gme may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes nmy occur alone, or in combination with the others, one or more times 
in a given sequence. 

35 "Altered** nuddc acid sequences flooding KPP include those sequmces wiftx deletions, 
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inserdoos, or substitatioiis of dJiS^ent nncleotides, xesidting in a polypeptide the same as KPP or a 
polypeptide mfh at least one fuiictiooalGliarac^^ Iiidaded widim lliis dejBnitioii aie 

polynKiiphisiDS vMch may or may sot be readi^ detectable using a particular oligosBicleotide probe 

of the polynacleotide eocodiog KPP, and inq;>ioper or imeicpected hybridization to allelic variants, 

1 

5 with a locus other than the nonnalcbromosomalloc^ The* 
encoded piotdn may also be **altered," and may contain deletions, iosertLons, or substitutions of 
Aminn acid residues Tsbich produce a dlrat cbange and result in a functionally equivalmt KPP. 
Deliberate amino acid substitutions may be made on flie basis of one or more sinslarities in polarity, 
charge, solubility, bydiophobicity, hydrophilicity, and/or the anqpbipatfaic nature of die lesidues, as 

10 long as the biological or immnnologjicd activity of KPP is retain^ F6r «anq>le, negatively charged 
amino adds may include aspartic add and ghitaidc add, and positiv^ charged ansno adds may 
include tysine and arginine. Amino adds with uncbarged polar side diainsbaviii^sinnla^ 
hydrophilicity values may include: asparagine and ghitannne; and smne and threonine. Amino acids 
with uncbarged side diains baving sinnlar b^drophilicity values tmy include: leucine, isoleucine, and 

IS valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino add** and "annuo add sequence"' can refer to an oligopeptide, a peptide, a 
polypeptide, or a protein sequence, or a fragment of any of these, and to naturally occurring or 
syothetic molecules. Where "aidno acid sequence" is recited to reSes to a sequence of a naturaUy 
occurriiig protein molecule, "antnno acid sequence'* and like terms are not meant to linnt the amino 

20 add sequence to tbe complete native annuo add sequence associated with the recited protein 
molecule. 

"Amplification** relates to the production of additional copies of a nucleic acid. 
Anq)lification may be carried out using polymerase chain reaction (PCK) technolo^es or other 
nucleic add anqplification technologies well known in the art. 

25 Ibe torm "antagonist** refers to a molecule whidi inhibits or attenuates the biological acti^ty 
of KPP. Antagonists may indude proteins sucb as antibodies, anticalins, nuddc adds, 
' carbohydrates, small noolecules, or assy other conqpound or conoposition whidi modulates the activity 
of KPP ddier by directly interacting with KPP or by acting on conqionents of die biological pathway 
in wbicb KPP participates. - "* - - 

30 The term "antibody** refers to intact iimnunoglobulin molecules as wdl as to fragments 

thereof, sudi as Fab, F(ab*)2, and Fv fragmeats, wbidi are capable of binding an qpitopic detemdnanL 
Antibodies that bind KPP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the imnounizing antigen. Hie polypeptide or oligopeptide 
used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of 

35 RN A, or synthesized dienmcdly, and can be conjugated to a carrier protem Commonly 
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used canieis that are diendcally coupled to p^tides inchide bovine sorum albumin, fliyroglobulin, 
andke^olelixDpethenvK:yaiiin(KLH). The coupled peptide is then used to inmrnnize the amznal. 

T><> tparo "at^genic Ae &ermnanr * refers to fliat region of a molecule (Le., an efUtope) that 

jnakes contact \^th a paiticolar antibody. When a protein or a fira g m en t of a protdn is used to 
S immunize a host animal, numeious regions of flie protein may induce the production of antibodies 
which bind specificaOy to antigenic deteammnants (particular regions or thiee^dimraisionsd structures 
ontheprotdn). An antigouc determinapt may conyete wifli the intact antigen 0.e., the immnnogen 
used to elicit die immune response) for biodiie to an antibody. 

The term ''aptamer'* rders to a nucleic add or oligonucleotide noolecule that binds to a 
10 specific molecular target Aptamers are derived from an in vitro evolutionary ]tfocess (cg^ SBLEX 
(Systematic Evolution of Ijgands by Exponential Enrichment), described in U.S. Patrait No. 
S»270,163X which selects for target-specific aptamer sequences firomlarge conibinatorial libraries. 
Aptamsr con^ositions imy be double-stranded or diigle-stranded, and may inclade 
deoxyribonucleotides, ribonudeoti^, nucleotide derivatives, or ofh^ nucleotide-like molecules. 
IS The micleotide conqionaits of an aptamer may have modified sugar groups (e.g., the 2'-OH group of a 
ribonucleotide may be replaced by 2'-F or 2 -NH2), wludi may inqprove a desired proper^, e.g., 
resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g,, a higji molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers maybe specifically cross-linked to their cognate ligands, e.g., by photo-activadon of a 
20 cross-hnker (Brody, E.N. and L. Gold (2000) J. Bioteclinol. 74:5-13). 

The term "intramer^* refers to an aptamer which is expressed in vivo. For exan^le, a vaccinia 
virus-based RNA expression system has been used to express specific RNA aptamers at high levels in 
flie cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl. Acad. Sci. USA 96:3606-3610)- 
The term "spiegelm^" refers to an aptamer wluch includes L-DN A, L-RNA, or other left- 
25 handed nucleotide dwivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes* which normally act on 
substrates containing right-handed nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with flie **smse" 
(coding) strand of a polynucleotide having- a specific nucleic acid sequence* 'AoBsSosi^ cooqpositibns 
30 may include DNA; RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone 
linkages such as phosphorothioates* methylphosphonates, orbenzylphosphonates; oligonucleotides 
having nidified sugar groups such as 2 -metiioxyethyl sugars or 2'-metiioxyedioxy sugars; or 
oligonucleotides having modified bases such as 5-methyl cytosine, 2'-deoxyaracil, or 7-deaza-2 - 
deoxyguanosine. Antisrase molecules may be produced by ariy method including chemical synthesis 
35 or transcription. OnceintroducedintoaceiU, thecoiiq>lencKntary antisrasernoleculebase^p 
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natnially occanix^ middc add sequence produced by die cdl to fonn duplexes wliidi block dther 
transcription or translation. The dedgnation ^'negative" or "nnnos" can le;^ to die antis«se strand, 
and ffae designation ''positive'* or '*idu5** can refo to the sense strand of a reference DNA molecula 

Tbe term l^lologjicaUy active" refers to a protein having structoral, regulatory^ or biochenncal 
5 ftmctioxisof a natoraUy occurring nudecule. Ulcewise/'iimianolo^cally active** or "inaDDDanog»ic^ 
refiers to the capability of Oe natoral, reconibinant, or synthetic KPP, or of any ofigopeptide thmof , 
to induce a spedfic immune response in appropriate animals or ceUs and to bind widi specific 
antibodies. 

"Conaplementary'* des<^bes die rdationship between two single-stranded nncldc add 
10 sequences that anneal by base-pairiqg. For example, 5'-AGT-3' pairs widi its conqdement, 
y-TCA-y, 

A ''conqsosition conqxrising a given polynudeotide** and a "conc^Kisition conqirising a given 
polypqptide*' can refo to any conqiosition containing- the ^ven polynucleotide or polypeptide. The 
conqKisition may con^se a dry formulation or an aqueous solution. Conpositions comprising 

IS polynncleotides encoding KPP or firagmi^ of KPP may be eonployed as hybridization probes. Ihe 
probes may be stored in fieeze-dried form and inay be assodated with a stabil^^ agent sudi as a 
carbohydrate. In hybridizations, the probe may be dqfloyed in an aqueous solution containing salts 
(e.g.. NaCl), detergents (e.g., sodium dodecyl sulfate; SDS\ and other conqponents (e.g., Denhardt's 
solution, dry milk, sahnon sperm DNA, etc.). 

20 '^Consensus sequence*' refers to a nncldc acid sequence which has been subjected to repeated 

DNA sequence analysis to resolve uncalled bases, extended using the XLrPCR kit (Applied 
Biosystems, Foster City CA) in the 5* and/or the 3' direction, and resequenced, or which has been 
assembled from one or nK>re overlapping cDNA, EST, or genomic DNA fragments using a concputer 
program for firagoasnt assembly, such as the GELVIEW fragment assembly system (Accelrys, 

25 Burlington MA) or Phrap (Uiuversity of Washington, S eatfle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions tiiat are predicted to least 
interfere with the properties of the original protein, i.e., the stracture and especially the function of 
the protein is conserved and not significantiy changed by such substitutions: The tablebeloW ^bws 

30 anuno acids which may be substituted for an original anuno acid in a protdn and whidi are regarded 

as conservative annuo acid substitutions. 

Original Residue Conservative Substitution 

Ala Gly. Ser 

Axg His, Lys 

35 Asn Asp, Gin, His 

Asp Asn, Glu 

Cys Ala, Ser 
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Gin 


Asn, oiQ» His 


Gla 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, uln, vilu 


lie 


Lea, Val 




lie, vai 




Arg, vtHi 


Met 


Leu, He 


Phe 


His, Met, Lai, Tip, Tyr 




Cys, Thr 


Thr 


Ser, Val 


Tip 


Phe, Tyr 


Tyr 


His. Phe, Tip 


Val 


n6,Lea,Tbr 
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Oniservative anuno add substitatioiis gai^rally maintain (a) the stiuctore of tbe polypeptide 
backbone in flie aiea of die snbstitution, for example, as a beta sheet or alpha helical conformation, 
(b) the diaige or hydrophobici^ of the molecule at the site of the substitution, and/or (c) the buUc of 
die side chain. 

20 A '^ddetion" refers to a change indie amino acid or nucleotide sequmce that results in the 

absmce of one or more anuno acid residues or nucleotides . 

The tmn ''derivative'* r^ers to a cbmncaUy modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, fin* example, rq>lacCTent of hydrog^ by an 
aBq/l, acyl, hydroxyl, or amino group. A dmvative polynucleotide encodes a polypeptide which 
25 ret£uns at least ojaebiolo^cal or immunological function (xf A derivative . 

polyp^tide is one modified by g^cosylation,pegylation, or any similar process that retains at least 
one biolo^cal or immBonological function of the polypeptide from which it was derived. 

A ''Electable label*' refers to a reports molecule or esayvoR diat is capable of genmtiQg a 
measurable signal and is covalently or noncovalenlly joined to a polynucleotide or polypeptide. 
30 "Diffi^ential expression" refers to increased or upregulated; or decreased, downregulated, or 

absent gene or prot^ expression, determined by comparing at least two diif^nt san^les. Snch 
conq>aiisons maybe carried out between^ for example, a treated and an untreated sarople, or a 

diseased and a normal saniqple. ^ 

"Exon shuffling*' refers to the reconibination of diiSerent coding regions (exons). Since an 
35 exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled tiirough the novel reassortment of stable substmctures, thns allowing acceleration of the 
evolution of new protein functions. 

A ^'fragment" is a unique portion of KPP or a polynucleotide encoding KPP which can be 
identical in sequence to, but shorter in length than, the parent sequence. A fragment may comprise up 
40 to the entire length of die defined sequence, minus one nucleotide/amino acid residue. For example, a 
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fiagiDBixt may coinprise firom abom 5 to about 1000 cwtigaous sudeotides or axmuD acid residues. A 
fragment used as a probe, primer, antigeD, duerapeutic molecule, or for oth^ purposes, may be at least 
5, 10, 15, 16, 20» 2S> 30, 40, 50, 60, 75, 100, ISO, 250 or at least 500 contigaous nucleotides or annuo 
add residues in iCTgdL Fragments may be preferaitiany selected firomotttainregjLons of a molec^ 
S For exanQ>le» a polypeptide fragoKaotniay comprise a certain le^^ 

selected from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a ; 
certain defined sequence. Qearly these lengths are exeniplary, and any l«efhttmt is ra^ 
die specificatimi, including tiie SequCTce Listing, tables, and figures, may be raoonqpassed by the 
present raibodimaits. 

10 A firagmeot of SEQ ID NO:2 can comprise a re^on of unique polynucleotide sequ^ice tiiat 

specificaUy identifies SEQ ID NO:2, for exanqxle, as distinct fiomaioy othm' sequence in the genome 
from which die fiagmrait was obtained. A fragmmt of SEQ ID NO:2 can be en^loyed in one or more 
CTabodin^nts of methods of the invention, fox exaxq)le, in hybridization and anqilification 
technologies and in analogous methods that distinguish SEQ ID NO:2 from related polynucleotides. 

15 The pcecise lex^ of a fragn^t of SEQ ID NO :2 and the region of SEQ ID NO:2 to which die 
fiagmsnt corresponds are routinely det^nunable by one of ordinaiy s^ in die art based on the 
int^ided purpose for the fragment 

A fragment of SEQ ID NO:l is encoded by a fragment of SEQ ID NO:2. A fragment of SEQ 
ID NO:l can conQ>rise a region of unique amino acid sequence that specifically identifies SEQ ID 

20 NO:L For 6xan^)le, a fi-agment of SEQ n:)NO:l can be used as an immunogenic peptide for the 
development of antibodies that specifically recognize SEQ ID NO:l. The precise length of a 
fragm^t of SEQ ID NO:l and die region of SEQ ID NO:l to which the firagment corresponds can be 
detercmned based on the intended purpose for the fragment using one or more analytical mediods 
described herein or otherwise known in the ait. 

25 A *'fiill length" polynucleotide is one containing at least a translation initiation codon (e.g., 

noBthiomne) fidlowed by an open reading frame and a translation tenmnation codon. A '^futt length** 
polynucleotide sequmce encodes a "full length" polypeptide sequence. 

"Homology** refers to sequence similarity or, altecnatively, sequence identity, between two or 
more polynucleotide sequences or two or more polypeptide sequences. - . - - - " 

^ 30 The terms "percent identity** and **% identity,** as applied to polynucleotide sequences, refer 

to the percCTtage of identical nucleotide matdies between at least two polynucleotide sequences 
aligned using a standardized algorithm. Such an algorithmmay insert, in a standardi29ed and 
reprodudble way, gaps in die sequences being con^ared in order to optimize alignment betwem two 
sequences, and thoreSbre achieve a more meaningfiil comparison of the two sequences. 

3S Percent identity between polynucleotide sequences may be detiaxmned using one or more 
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computer algoriOmis or prograimfaM>wn in tte art or descnTiedhe^ For example, peicent idiatfity 
can be detennmed usiflg the default parameters of the CLUSTAL V algoriflim as kcoiporated into 
flieMBGALIGN version 3. 12e sequence aHgnmentpn,grani This program is part of the 
LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR. 
5 MadisonWD. CLUSTAL Vis descifted in Higgins. D.G. and P.M Sharp (1989; CABIOS 5:151- 
153) and in Higgins, D.G. et al. (1992; CABIOS 8:189-191). For pairwise aKgnmaits of 
polymdeotide sequences, the default parameters are set as follows: Ktuple=2. gap penaltyt=5, 
windows, and "diagonals saved"=4. TTie "wdghted" n«idue weight table is selected as flie default 
Altemativdy, a suite of commonly used and fredy avaflable sequence comparison algorithms 
10 i«*idi can be used is pmvided by the National Center for Biotechnology Information (NCBI) Basic 
Local Alignment Seardi Tool (BLASTS (Altschul. S.F. et al. (1990) J. Mol. Biol. 215:403-410). 
Mihich is avaflable fiom several sources, inchiding the NCBI, Befliesda. MD. and on liie Intemet at 
ncbi.ritauiih.gov/BLAST/. The BLAST software suite inctades various sequence analysis programs 
inclading "blastn." fliat is used to align a known polynucleotide sequence wifli other polynucleotide 
15 seqoMcesfiomavarietyofdatabases. Also available is a tool called "BLAST 2 Sequences" that is 
used for firectpairwise comparison oftwonndeotide sequences. "BLAST2 Sequaices" canbe 
accessed and used inleractiveay at iicbi.ntaLnaLgov/goff/bl2.htmI. The "BLAST 2 Sequences" tool 
canbeusedforboaiblastnandblastp(discussedbeIow). BLAST programs a« commonly used with 
gap and other parameters set to default settings. For example, to compare two nucleotide sequ«ices 
20 o«™y«»blastnwi,hthe«BLAST2Sequ«K«»»toriVersion2.0.12(April.21-^^^ 
parameters. Sueih default parametas may be, for example: 
Matrix: BLOSUM62 
Reward fornuuch: 1 
Penalty for mismatch: -2 

25 Open Gap: 5 and Extension Gap: 2 ixnalties 

Gap X drop-off: 50 
Expect: 10 
Word Size: 11 

Filter: on _ _ , _ _ 

Percent identity may be measured over the lei^ of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for exainple, ' 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at ' 
least 20, at least 30, at least 40. at least 50. at least 70. at least 100, or at least 200 contiguous 
nucleotides. Such lengflis are exemplaiy only, and it is understood that any fragment 
supported by ihe sequences shown herein, in ti» tables, figures, or Sequence Listiflg, may be used to 
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desciibe a l^igSi over vMch p^rceutag^ identity may be measured* 

Nucleic acid sequeoces fliat do not show abigh degree of identity may oevertbeless encode 
sjinQaraimiui add sequences due to Ibedeg^iBiacy of the gei^ Itisund^stoodtfaatdianiges 
in a nucldc acid seqaooce canbe made using diis degenmcy to produce mnltqple nucldc acid 

5 sequCTces fliat all encode substantiaHy die same protein. 

The phrases ^'pocent identity^ and ideati^/* as applied to po^^pqptide sequences, teiex to 
die p^pceotage of identical residue matches between at least two polypq;ytide sequences aUgned usixig 
a standardized algoiiflun. Methods of poIypq>tide sequrace dignmrait are weQ-known. Some 
alignnKUt n^thods take into account conservative amino acid substitutions. Sucb cons^vative 

10 substitutions, explained in more detail above, general^ preserve llie chargp and hydnq»bobicily at the 
site of substitution, Ifaus preservii^ tbe structure (and therefore ftanction) of ibe polyp^tide. The 
0irases **percent similaii^' and *'% sindlarity,'' as applied to polypeptide sequences, refor to the 
peicatage of residue matches, including identical residue matches and conservative substitutions, 
between at least two polypeptide sequmces aligned using a standardized algorithm. Incontrast, 

IS conservative substitutions are not included in the calculation of percent idra[itily between polypeptide 
sequences. 

Percent id^itity between polypeptide sequences may be determined using the default 
param^^ of the CLUSTAL V algorithm as incorporated into tiie MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignmmts of 
20 polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty^3, window^S, and "diagonals saved''=5. The PAM250 matrix is selected as^tiie d^ault 
residue weight table. ^ 

Alternatively the NCBl BLAST software suite may be used. For exanqde, for a pairwise 
conq>arison of two polypeptide sequences, one may use the '^BLAST 2 Sequences** tool Version 
25 2.0. 12 (April-21-2000) with blastp set at default parameters. Sucb default parameters n^y be, for 
exan^le: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap X drop-off: 50 - - ^ — " 

30 Ejqpect: 10 

Word Size: 3 
Filter: on 

Percent idratity maybe measured over the length of an entire defined polypq)tide sequence, 
for 6xarxq)le, as defined by a particular SBQ ID nuinb», or may be measured over a shorter lengfli, for 
35 exan^le^ over the lengdi of a fragm^ tafcm from a laig^, defined polypeptide sequence, for 
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instance, a fragment of at least 15. at least 20. at least 30. at least 40, at least 50. at least 70 or at least 
150 contiguous residues. Such lengths are exen5)lary only, and it is underetood that aigr fragment 
length supported by the sequences shownherein. in the tables, figures or Sequence listii^, maybe 
used to describe a length over percentage ideaatity may be measured. 

"Human artificial diromosomes" (HACs) are linear microchromosomes ^vhich may contam 

DNA sequences of about 6 H, to 10 Mb in size and which contain an of the demails re^^ 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino add 
sequence in die non-antigen binding regions has been altered so that the antibody more dosdy 
resen*les a human antibody, and still retains its original binding ability, 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complemBntaiy strand fiinni£|ibase pairing under defined hybridization conditions. Specific 
hybridization Is an mdication Oat two nucleic acid sequences share a high degree of 
coinplementarity. Specific hybridization complexes form under permissive annealing conditions and 
i«iiainhybridized alter the "washing" step(s). The washing step(s) is particularly important in 
detOTninnig the stringency of the hybridization process, wifli more stringent conditions allowing less 
non-specific bindiiig. ie.. binding between pairs of nucleic add strands that are not perfectly 
matched. Permissive conditions for annealing of nucleic add sequences are routinely determinable 
by one of otdipaiy skill in the art and may be con»st«it among hybridization experiments, whereas 
wadi conditions may be varied amoi^ eixperiments to achieve the desired rt^ 
hybridization spedfidty. Pennissive annealing conditions occur, ifar example, at 68'C in die 
presence of about 6 x SSC. about 1% (w/v) SDS. and about 100 /tg/ml sheared, denatured sabmn 
spomDNA. 

Getierally. stripgency of hybridization is expressed, in part, with reference to flie temperature 
underwhichthewa8hstq,iscaniedout. Sud» wash tempaatures are typicafly sdected to be about 
5"C to 20-C lower flian die theraial mdling pdnt (T J for die spedfic sequence at a defi^ 
strength and pH. The T„ is the tisnqjerature (under defined ionic strength and pH) at whidi 50% 
thetaigetsequencehybridizestoaperfectlymatdiedprobe. An equation for calculating T„ and 
conditions for nucleic add hybridization are weD known and CMl»-fotind m-Sifi*rdd^; jTind D.Wr~ 
RusseP (2001 ; Molecular Cloninp- A I .aboratory Man^ial. 3rd ed.. voL 1-3. Cold Spring Harbor Prt»s. 
Cold Spring Harbor NY. ch. 9). 

High stringency conditions for hybridization between polynucleotides of the preseiit 
invention include wash conditions of 68«>C in the presence of about 0.2 x SSC and about 0.1% SDS. 
for 1 hour. Alternatively, ten^ieratures of about 65»C. 60»C. 55»C. or 42»C may be used. SSC 
concentration may be varied from about 0.1 to 2 x SSC. with SDS bdng present at about 0.1%. 
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T^picaUy. blocKng reagrats are used to block nonrspecafic ]q«iidizatiaiL Sucb blocldi^ teageats 
include, for instance, sheared and denatured salmon sperm DNA at about 100-200 /igAnL Organic 
solvent, such as ibnnamide at a concentration of about 35-50% v/v, may also be used under particular 
circumstances, such as for RNA:DNA hybridizations. Useful variations on fliese waiih conditions 
will be readily apparent to those of ordinary skill in the art Hybridization, particularly under Ugh 
stringency conditions, may be suggestive of evolutionary similarity between the nucleotides. Such 
simflarity is strongly indicative of a similar role for flie nucleotides and Ihear oicoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic adds by 
virtue of flie formation of hydrogen bonds between camplemaitaiy bases. Ahybridization compl« 
may be formed in sohition (e.g.. Cot or Rot analysis) or formed betweoi one nucleic acid present in 
sohition and another nucleic acid immobilized on a soHd support (e.g., paper, membranes, filters, 
chips, pins or glass sBdes. pr any other, appropriate substrate to vMch cells or thdr nuddc adds have 
beraifixed). 

The words "insertion" and "addition" refer to changes in an amino add or polynucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Dnmiuie response" can refer to conditions assodated with inflammation, trauma, immune 
disorders, or mfections or genetic disease, etc. These conditions can be characterized by expression 
of various fectors. e.g.. cytokines, dieniokines, and other signaling molecules, which may affect 
cellular and systranic demise systems. 

An "immunogenic fragment" is a polypeptide or oKgopeptide fragment of KPP which is 
capable of didting an immune response when introduced into a living organism, for example, a 
mammaL The term "immunogenic fragment" also includes any polypeptide or oUgopepti^ fragment 
of KPP whidi is useful in any of the antibody production methods disdosed herdn or kno^ in die 
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The term "ndcroarray" icBms to an arrangemeait of a phirality cKfpo^nudeotides. 
pdypeptides, antibodies, or oflier dMamcal conjMJunds cm a substrate. 

The terms "dement" and "array donent" refer to a polynudeotide. polypqitide. antibody, or 
other chemicd compound havii(g a unique and defined position on a micioarxay. 

The term "modiilate" refers to a diange in flie activity & KPP. Tot foiis^Ki^aSaa&^is^ 
cause an increase or a decrease in protdn activity, bindiflg characteristics, or aiqr other biological, 
functional, or immunological properties of KPP. 

The phrases "nuddc add" and "nuddc add sequence" refer to a nudeotide. oligonudeotide. 
polynucleotide.oranyfragmBntfliereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be singje-stranded or double-stranded and may iqiresent the sense or the 
antisense strand, to peptide nucldc add (PNA), or to any DMA-like or RNA-Iike material 
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"Operably linked" refers to liie situation in whidi a fiist nucleic acid sequence is placed in a 
fimctional relationship wifli a second nucleic acid sequence. For instance, a promoter is opeiably 
linked to a coding sequence if liie promoter affects flie transcription or expression of flie coding 
sequence. Operably linked DNA sequences may be in dose proximity or contiguous and. where 
necessary to join two protein coding regions, in the same reading ftame. 

"Peptide nucl«c acid" (PNA) refers to an antiseaise molecule or anti-g^ agent vi4ncih 
comprises an oligonucleotide of at least abont 5 nudeotides in lengft linked to a pep&Se baddxme of 
amino acid residues ending in lysine. The terminal lysine confers sohibility to the composition. 
PNAs prefeentially bind complementary single stranded DNA or RNA and stop transcript 
do^gation, and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an KPP may involve Upidation, ^ycosylation, 
phosphorylation, acetylation, racenrization, proteolytic cleavage, and other modifications known in 
the art Hese processes may occur synthetically or biocbemicany. Biodiemical modifications wiD 
. vary by cdl Qrpe dqiending on the oizymatic milieu of KPP. 

"Probe" lefos to nuddc adds encoding KPP, their cor^lements, or fragmeaits diereof. 
w*idi are used to delect identical. aUeUc or related nncldc adds. Probes are isolated 
oJifionudeotides or pidynadeotides attadied to a detectable labd or reporter molecule. Typical 
labels indnde radioactive isotopes, Kgands, dienuhmanescoit agents, and enzymes. "Primers" are 
sbmt nuddc adds, usually DNA oligpnucleotides. wUch may be annealed to a target polynucleotide 
by compkamaitaiy base-pairing. The primer may thenbe extended along flie target DNA strand by a 
DNA po^merase enzyme. Primer pairs canbeused fat amplification (and idenljficatbn) of a nucldc 
wad. e.g., by the polymerase diain reaction (PCR). 

Probes and primers as used in the piesent invention typicafly comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specifidly. longer probes and primers may also 
be employed, sudi as probes and primers tiiat comprise at least 20, 25. 30, 40. 50. 60. 70. 80. 90, 100. 
or atleast 150 consecntivenodeotidesoffliedisdosednuddc add sequences. Probes and primers 
maybe considerably longer tiian fliese examples, and it is understood tiiat aay lengtii supported by tiie 
specification, includiiig the tables, figures, and Sequoice Listiiig, noay be used. 

Methods for prqwring and using probes and primers are dsswribrf 

Sambrook, J. and D.W. Rns!^ (2001; Molecular aonin g: A Laboratnrv M,^ i.«i Srf ed.. vol. 1-3. 
Cold Spring Harbor Press. Cold Spring Harbor NY), Ansubel. FJM. et aL (1999; Short Protocols in 
Molecular Biolo r y . 4* ed., John Wiley & Sons. New YoricNY). and Innis, M. et aL (1990; PCR 
Protocols, A Guide to Meti^yds apd Applications , Academic Press. San Diego OA). PCR primer pairs 
can be derived from a known sequence, for example, by usir« computer programs intended for tiiat 
purpose sudi as Primer (Version 0.5. 1991. Whitdiead Institute for Biomedical Researdi. Canibridgp 
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Oligonucleotides for use as prints arc selected usipg softwaie kno^ in the art for such 
puipose. For example. OUGO 4,06 software is useful for the selection of PGR primer paire of up to 
100 nucleotides each, and for the analysis of oligonucleotides and lar^ po^nndeotides of up to 
5 5.000 nucleotides from an ii^ut polynucleotide sequence of up to 32 kOobases. Similar prinw 

selection programs have incorporated additional features for expanded capabifities. For exanq>le, die 
PrimOU primer selection program (available to the public from the Gcabm Crater at IMversily of 
Texas South West Medical Center. Dallas TX) is capable of dioosing specific prinws fiom 
megabase sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 
10 primer selection program (available to tiie public firomthe Whitehead Institute^MTT Center for 
Genome Research. Cambridge MA) aHows the user to input a "nrispriming hTjrary." in whidi 
sequences to avoid as primer bindmg sites are user-specified. PiimBr3 is useftil, in particular, for the 
selection of oligonucleotides for ndcroarrays. (The source code for die latter two primer selection 
programs may also be obtained fiom their respective sources and modified to nset tiie user's specific 
needs.) The PrimeGen program (available to the pubUc from flie UK Human Graome Mapping 
Project Resource Cmtre. Cambridge VK) designs primers based on multiple sequence aUgnments. 
thereby aHowing selection of primers that hybridize to dtiier tiie most conserved or least conserved 
r^ons of aligned nucleic acid sequences. Hence, this program is useful for identification of both 
unique and conserved oligonucleotides and polynucleotide fragments. The oligonucleotides and 
polynucleotide firagmeaits id^itified by any of the above selection methods are usefal m hybridization 
tec3inolo^es. for GKasrple, as PCR or sequencu^ primers, microarray elements, or specific probes to 
identify folly or partially complementary polynucleotides in a sample of nucleic acids. Methods of 
oligonucleotide sdection are not lumted to those described above. 

A ''recombinant nucl^c acid" is a nucldc add that is not naturally occurring or has a 
sequeaice fiiat is made by an artificial combination of two or more otherwise separated segments of 
sequence. This artificial combination is often acconq>lished by chendcal synthesis or, n»re 
commonly, by the artifidal manipulation of isolated segments of nncldc adds, e.g., by ^snetic 
engineerii^ techniques such as tiiose described in Sanobrook and Russefl (/supra). The term 
reconoibinant indndes nucldc adds that have bem altered solely by addition, substittttibh,^l[detiMr 
of a portion of the nucleic add. Frequentiy, a recombmant nucldc acid may include a nudeic acid 
sequence operably linked to a promotiar sequence. Such a recombinant nucldc acid may be part of a 
vector that is used, for example, to transform a cell. 

Altemativdy, such recombinant nucldc adds may be part of a viral vector, e.g. , based on a 
vaccinia vims, fliat could be use to vaccinate a mammal wherein the reconiinant nucleic acid is 
expressed, inducii^ a protective immunological response in the fnanm^i 
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A ''regidatoiy etement** refers to a nnddc atid sequence usually dmvcd finxm untranslated 
xenons of a g^e and includes enbanceis, promoteis, intnms, and S and 3' untranslated i^ons 
(UTRs). Regulatory elements intmct widi host or viral protdns viiidh control transcription, 
translation, or RNA stability. 

5 Hepoiter molecules** are cbramcal or biocbemical nxnedes used for labding a nucldc acid, 

anmoadd^ or antibody. Rq[>ortBrxnDlecules include radionuclides; enzymes; fluoresce^ 
diranluminBscrat, or duombgenic agents; substrates; cofactors; inhibitors; magnetic partides; and 
odier moieties known in the art. 

An ''RNA equivalent,** in refer»C6 to a DNA molecule, is conposed of die same linear 

10 sequence of nucleotides as the reference DNA molecidewidi die »cepti^ 

xutrogenous base thynune are replaced with uracil, and die sugar bacld>one is con^xised of ribose 
instead of deoxyribose. 

Hie t»in''sanq>le** is used in its broadest seose. A sample suspected of containing KPP, 
nucldc acids ^icodijqg iOPP, or firagments thereof may contprise a bodily fluid; an extract from a cdl, 

15 cSuomosome, organdie, or membrane isolated fiom a ceD; a ceU; genonnc DNA, RNA, or cDNA, in. 
solution or bound to a suhstrate; a tissue; a tissue piim^ 

The t^ooK "qiedfic binding^ and "specifically binding*' refer to diat interaction between a 
protdn or p^tide and an agonist, an antibody, an antagodst, a small molecule, or any natural or 
^xrihetic binding coiiq)Osition. Hie int^action is dependent upon the presence of a particular 

20 structure of die protein, e.g. , the antig^c detercdnant or epitope, recognized by the binding 
molecule. For exaiiq>le, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled in a reactiom containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified'* refers to nucleic acid or amino add sequ^ices that are 

25 removed fromthdr natural environment and are isolated or separated, and are at least about 60% free, 
preferably at least about 75% free, and noost preferably at least about 90% free from oth^ 
conqponents with which they are naturally associated. 

A '"substitution" refers to the replacem^ of one or more amino acid residues or nucleotides 
by difierent anmio acid residues or nucleotides, respectivdy. - 

30 "Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
rmcroparticl^ and capillaries. The substrate can have a variety of surface forms, such as wells, 
tcimches, pins, channels and pores, to which polynucleotides or polypq^des are hound. 

A "transcript image*' or **cxptessum profile" refers to the collective pajttem of gene 

35 expression by a particular ceD type or tissue under given conditions at a given time. 
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^Transfonmtion'* describes a process by vSuxix exogenous DN A is introdaced into a xecipienl 
cdl. Tiaosfanmtionniayocciiriuiderimtiindor a 

wen koowa in flie ait, and may rely on any known method for the insertion of fordgn nucldc add 
seqpeiicesintoapfolcaryolicoreulcafyoticliostcdL Hie method for transformation is selected based 
5 ' on ttie type of liost ceU bdng transformed and may incihide, but is not linuted to, bacteriophage or 
viral infection, etectroporation^beat shock. Iqiofection, and particle bombardment The tenn 
"transformed cdls** inclades stably transformed cells in wbich the ins^led DN A is capable of 
replication ather as an autonomously replicating plasnud or as part of the host duomosome, as wdl 
as tran^enfly transformed oeDs i^di' express the inserted DNAor RNA for limited periods of time. 
10 A **transgenic organism," as nsedherein, is any orgamsm, including but not limited to 

animals and plants, in which one or more of the cells of the oiganism contains heterologous nucleic 
acid introduced by way of human intervention, sudi as by transgenic techniques well known in fhe 
act The nucleic acid is intiodnced into die cell, directiy or indirecfly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection widi 
15 a recombinant idnis. In anotiier enibodim^ the nucleic acid can be introduced by infection with a 
lecoinbinant viral vector, sudi as a lentivual vector (Lois, C* et al. (2002) Science 295:868-872). The 
t^m genetic manipulation does not include clasdcal cross-breeding, or in vitro fertilization, but 
ratiier is directed to die introduction of a reconffimiant DNA molecule. The transg^c organisms 
oontemptated in accordance wifli die present invention include bacteria, cyanobactma, fimgi, plants 
20 and animals. The isolated DNA of the present invention can be introduced into die host by methods 
known in the art, for example infection, transfection, transformation or transconjugatioiL Techniques 
for transferring the DNA of the preset invention into such organisms are widely known and 
provided in references such as Sambmok and Russell (jsvprd), 

A ''variant' of a particular nucldic acid sequence is defined as a nucleic acid sequ^ce having 
25 at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequMK:es using blastn with the "BLAST 2 Sequeaices" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at I6ast 97%, at l^ast 98%, or at least 99% or greater 
30 sequence identity over a certain defined length. A variant may be described as, for exanqple, an 

"allelic'* (as defined above), "splice," "species/* or "polymorphic" variant A splice variant may have 
significant identity to a reference molecule, but win generally have a greater or lesser number of 
polynucleotides due to alternate splicing during mRN A processing. The corresponding polypeptide 
may possess additional functional domains or lack domains that are present in the reference molecule. 
35 Species variants are polynucleotides that vary from one species to anoth^. The resulting 
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polypeptides will giem^aHy have significant aminD acid identity relative to eacb other. A 
polyii»rphic variant is a variation in the polyoucleotide sequCTce of a paiticular geoe between 
individaals of a ^ven species. PoIycoDrphic variants also noay ^sconq^ass "^ngle nucleotide 
polymorphisms** (SNPs) in which the polynadeotide seqaence varies by one nucleotide base. Ihe 

5 pxesenceof SNP& may be indicative of, for esanqfle, a certain population, a dise 
propensity for a disease state. 

A varianT of a particular polyp^tide seqa«ce is defined as a polypeptide sequaice havtog 
at least 40% sequence idootity or sequ^ice similarity to the pafticular potypeptide sequCTce over a 
c^tain length of one of the polypeptide sequCTces using blastp with tiie '*BLAST 2 Sequences" tool 

10 Version 2.0.9 Q4ay-07-1999) set at defetdt parameto. Such a pair of pd^e^tides may show, for 
example, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, 
or at least 99% or greater sequence identity or sequence sin&larity over a certain darned lei^gth of one 
of the polypeptides. 

15 

THEINVENTION, 

Various mbodia^sAs of the invention include new human kinases .and phosphatases (KPP), 
the polynucleotides encoding KPP, and the use of these conq)ositions for the diagnosis, treatment, or 
prevention of cardiovascular diseases, imnmne system disorders, neurological disorders, disord^ 

20 affecting growth and development, lipid disorders, cell proliferative disorders, and cancers. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
embodiments of the invention. Each polynucleotide and its corresponding polypeptide are correlated 
to a single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 

25 Incyte polypeptide sequence nundber (Incyte Polypeptide ID) as sho^MO. Each polynucleotide 

sequence is denoted by both a polynucleotide sequm^e identification nurnber (Polynucleotide SEQ 
ID NO:) and an Incyte polynucleotide consensus sequence nundber (Incyte Polynucleotide ID) as 
shown. 

Table 2 ^ows sequences with homology to polypeptide embodiinents of the invention as 
30 identified by BLAST analysis against the GenBank protein (genpept) database and the PROTEOME 
database. Columns 1 and 2 show the polypeptide sequence identification number (Polypeptide SEQ 
ID NO:) and the corresponding Incyte polypeptide sequence number (Incyte Polypeptide ID) for 
polypeptides of the invention. Column 3 shows the GenBank identification number (GenBank ID 
NO:) of the nearest GenBank honoolog and ttie PROTEOME database identification numbm 
35 (PROTEOMEIDNO:)oftiieneaiestPROTBOMEdataba5ehomDlog5. Column 4 shows tiie 
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probability scores for the matches between each polypeptide and its bomologCs). Cohinm S shows the 
annotation of the GenBank and PROTEOME database homologCs) along with relevant citations 
y/t^sm applicable, all of vMch are expressly incorporated by referrace hradn. 

Table 3 shows various stractnral features of the polypeptides of the inventioD. Cohunns 1 
5 and 2 show the polypeptide sequ^ce identification nuidber (SEQ ID NO:) and the corresponding 
locyte polypeptide seqaeoce noniber OIncyte Polypqitide ID) for each polypeptide of the inv^ition* 
Colunm 3 shows the number of anuno acid residues in each polypqyti Colunm 4 shoix^ annno 
add residues conQirising dgoature sequences, dom^bs, raotife, potential phosphorylation sites, and 
potential gl^cosylation dtes. Cohunn S shows analytical no^ods for protdn structure/function 
10 analysis and in some cases, searchable databases to which the anailytical nsethods were applied. 

Toge&er, Tables 2 and 3 summarize the prop^es of polypeptides of the invmidon, and these 
prop^es establish thai the dainosd polypeptides are kinases and phosphatases. For esan^le, SEQ 
ID NO:l is 98% idmtical, firomresidue Ml to residue S449, to human protdn phosphatase 4 
r^Ulatory subunit 2 (GeoBank ID g82S0239) as det^rxEoned by the Basic Local Alignment Seardi 
15 Tool (BLAST). (See Table 2.) The BLAST probability score is L4&241, i^di indicates the 

probabili^ of obtaining tb& observed polypeptide sequence alignment by diance. SEQ ID NO: 1 also 
has homology to human protein phosphatase 4 r^ulatoiy subunit 2, as deteramned by BLAST 
analysis usixig the PRCyTEOME database. The foregoing provide evidence that SEQ ID NO:l is a 
protdn phosphatase r^;idatQrysubumt The algorithms and parameters for the analysis of SEQ ID- 
20 NO:l are described in Table 7. 

As ^own in Table 4, full length polynucleotide enobodiments were assembled using cDNA 
sequCTces or coding (exon) sequences derived firom genonnc DNA, or any combination of these two 
types of sequences. Column 1 lists flie polynucleotide sequrace identification number 
(Polynucleotide SEQ ID NO:)» the corresponding Incyte polynucleotide consensus sequence number 
25 (Incyte ID) for each polynucleotide of the invention, and the length of each polynucleotide sequence 
in basq>airs. Column 2 lists fragments of the polynucleotides v^ch are useful> for exan^le, in 
' hybridization or amplification technologies that identify SEQ ID NO:2 or that distinguish between 
SEQ ID NO:2 and related polynucleotides. Column 3 shows identification nunibers corresponding to 
cDNA sequences, coding sequences (exons) predicted fromgenomicDNA* and/or sequence 
30 assemblages con^rised of both cDNA and genome DNA. These sequences were used to assenible 
the full length polynucleotide enibodiments. Columns 4 and 5 of Table 4 show the nucleotide start 
(SO and stop (3^ positions of the cDNA and/or genomic sequences in column 3 relative to their 
respective full length sequences. 

The identification nunibers in Column 3 of Table 4 may refer specifically, for exanqple, to 
35 Incyte cDNAs along mth their corresponding cDN A libraries. For exanq>le, 8245726H1 is the 



37 



EP.1722P 

identification nuniber of an locyte cDNA seqaence, and BONEUNROl is the cDNA library from 
wMch it is derived. IncytecDNAs for which cDNA libraries are not indicated were d»^ 
pooled cDNA libraries (e.g., 70744891 VI). Altetnattvely, flie identification nondbexs in cohunn 3 
n&y refer to GenBank cDNAs or BSTs (e.g., g384631 1) which contribttted to the asseanbly of the fall 

5 Iraglh polynucleotides. In addition, the identification nondbersmcolunm 3 noayidei^ 

d^ved from the ENSEMBL (The Sangra* Ceotre, Cambridge, UK) database (Le. , those sequences 
including die designation *'ENST^. Alternatively, the identification nundbers in column 3 may be 
derived from die NCBI RefSeq Nucleotide Sequence Records Database (jLe. , those sequences 
including die designation "7414" or "NT') or the NCBI RefSeq Protein Sequence Records (ie., ttiose 

10 seqaraiK^es including the designation ^'NP'O- Altmiativdy, the identification nnmbers in column 3 
nmy refo to assemblages of both cDNA and G^canrpreificted exons brought togedier by an "esLon 
stitching" algoriflmi. For exan?>le, VLJDOOOaUfiJ^2jnnnnrjfsJf4 represents a "stitched" 
sequCTce in which XXXXXX is the identification number of the clost^ of sequences to \if4iich the 
algorithm was applied, and YYYYY is the nuniber of die prediction gCT^ted by the algorithm, and 

15 Ni;2^^^ if present, r^resent specific exons that may have bem manually edited during anaty^ (See* 
BKanple V). Altemativety, the identification numbers in column 3 may to assemblages of 
exons brought togedier by an "exon-stretdui^** algorithm. For exan^le, 

¥lX3DODDCjB/iAAAAjsJBBBBB^\Jf is the identification number of a "stretched" sequence, ^iSi 
JQOKXarbdng the Incyte project idmtification numb^, gMAAA being the GenBank identification 

20 number of die human g^umnc sequence to ^ift^ algorithm was applied, 

gJBBBBB bdng the GenBaxik identification number or NCBI RefSeq identification nunibCT of the 
nearest GraBahk protein homolog, and iVrefeniiig to spec^ Ininstances 
where a RefSeq sequence was used as a protein homolog for the ^^exon-stretching'* algoritiun, a 
RefSeq identifier (denoted by "NM,** "NP,** or "NT") may be nsed in place of the GenBank identifier 

25 (i.e., gPBBBBy 

Alternatively, a prefix id^tifies component sequences that were hand*edited, predicted from 
g^iomic DNA sequences, or derived from a coinbination of sequence analysis methods. The 
following Table lists exanqples of conqionent sequence prefixes and corresponding sequence analysis 
. methods associated with the prefixes (see Exan^le IV and Exanbple V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon pi^diction from genomic sequences usmg, for exanqple, 
GENSC AN (Stanford Univ^ity, CA, USA) or FGENES 
(Compute Genomics Group, The Saiiger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genonuc sequences. 


FL 


Stitched or stietched gmomic sequences (see Exanqple V). 
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Fall leagth transcript and exon predictioa £rom impping of EST 
sequences to the genonoe. Genonodc location and EST con^osition 
data aiecondbined to preset the exons and resulting transcript 



In some cases, Incyte cDNA coverage redundant with Oie sequence covmgo shown in Table 
4 was obtained to confinn fhe final consensus polynucleotide sequmce^ but the relevant Incyte cDN A 

5 Idrattficatioa munb^ are not shown. 

Table 5 shows the rqxcesentative cDNA libraries fen: those full l^igA polynucleotides which 
wexe assCTbled using Incyte cDNA sequences. The rqires^itative cDN A library is the incyte cDN A 
library vMkSi is most fireqaently represented by Ote bu^cDNAsequmces v/idck were used to 
assenftrle and confirm the above polyaucleotid^. The tissues and vectors vdiich were used to 

10 construct the cDN A libraries s3iown in Table 5 are described in Table 6. 

Table 8 shows single nucleotide polynxxrphisnis (SNPs) found in polynucleotide sequoioes of 
Reinvention, aloi^ with alMefirequendes in different liunmipo^ Columns 1 and2 show 

the polynucleotide sequence identification nu0fl>er (SEQ ID NO:) and the coiresponding Incyte 
project identification muriber (JPOJ) for po^^mucleotides of the invention. Cohinm 3 shows the Incyte 

15 identification nundber for the EST in which the SNP was detected (EST ID)» and column 4 shows tiie 
identification number for the SNP (SNP ID). Column 5 shows die position within (he EST sequence 
at whidh the SNP is located (EST SNP), and column 6 shows the position of the SNP vnMa the fidl- 
length polynucleotide sequence (CBl SNP). Column 7 shows die allele found in the EST sequence. 
Columns 8 and 9 show the two alleles found at die SNP site. Column 10 shows the amino add 

20 CTCoded by die codon including the SNP site, based upon the allele found in the EST. Cohunns 11- 
14 show the frequency of allele 1 in four di&rent human populations. Anentry of n/d (not detected) 
indicates diat the iErequency of allele 1 in the popcdation was too low to be detected, while n/a (not 
available) indicates tiiat die allele frequency was not determined for the population. 

The invention also encon^asses KPP variants. Various enibodiments of KPP variants can . 

25 have at least about 80%, at least about 90%, or at least about 95% amino acid sequence identity to the 
KPP amino acid sequence, and can contain at least one functional or structural diaracteristic of KPP. 

Various embodunents also caicompass polynucleotides which encode KPP. In a particular 
embodiment, the invention enconqpasses a polynucleotide sequence conqirising a sequence selected 
from fhe group consisting of SEQ ID NO:2, which encodes KPP. The polynucleotide sequences of 

30 SEQ ID NO:2, as presented in the Sequence Listing, embrace the equivalent RNA sequences, wherdn 
occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is 
composed of ribose instead of deoxyribose. 

Tlie invention also encompasses variants of a polynucleotide encoding KPP. In particular. 
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such a variant polynucleotide will have at least about 70%» or altramtively at least about 85%, or 
even at least about 95% polymicleotide sequence id^otity to a polynucleotide encodtng KPP. A 
particular aspect of the inveaitiQii encoinpasses a variant of a polyandeotLde cc»q;irising a sequence 
selected from the group conasting of SEQ ID N02 wludihas at least about 70%, or altematively at 
5 least about 83%, or evrai at least about 95% polynudeolide sequence identity to a nuddc acid 
sequence sdected from the gnrapconsistiDg of SEQ ID NO:2. Any one of the polynucleotide 
variants described above can encode a polypqitide ^nUch contains at least one functional or structural 
diaracteristic of KPP. 

In addition, or in tiie altmiative, a polynucleotide variant of tiie invCTtion is a spHoe variant 

10 of a polynucleotide encoding KPP. Asplice variant may have portions which have significant 
sequence identity to a polynucleotide encoding KPP, but will generally have a greater or less^ 
nunober (rf nucleotides due to additions or deletions of blocks of sequence arisiqg from alternate 
splidng during noRN A processing. A splice variant may have less than about 70%, or alternatively 
less tiian about 60%, or altemativdy less than about 50% polynucleotide sequ»ce identity to a 

15 polynucleotide encoding KPP over its entire length; howev^, portions of the splice variant vnH have . 
at least about 70%, or attematively at least about 85%, or alternatively at least about 95%, or 
altemativdy 100% polynucleotide sequence ides3&ty to portions of the polynucleotide encoding KPP. 
Any one of die splice variants described above can encode a polyp^tide ^^ch contains at least one 
functional or structural characteristic of KPP. 

20 It will be appreciated by those sldlled in the art that as a result of the degeneracy of the 

genetic code, a multitude of polynucleotide sequences encoding KPP, some bearing minimal 
sinnlarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be nmde by selecting combinations based on possible codon choices. These 

25 combinations are made in accordance with the standard triplet genetic code as applied to tbo 

polynucleotide sequence of naturally occurring KPP, and all such variations are to be considered as 
being specifically disclosed. 

Although polynucleotides which encode KPP and its variants are generally capable of 
hybridizing to polynucleotides encoding naturally occurring KPP liiiaer appropriately selected 

30 conditions of stringency, it may be advantageous to produce polynucleotides encoding KPP or its 

derivatives possessing a substantially difTmnt codcm usage, e.g., inclusion of non-naturaUy occurring 
codons. Codons noay be selected to increase tiie rate at whidi expression of the peptide occurs in a 
particular prokaryotic or eukaryotic host in accordance with the frequency with "wUch particular 
codons are utilized by the host Other reasons for substantially altering tiie nucleotide sequence 
35 ^coding KPP and its d^vatives without altering tiie encoded anano acid sequences include the 
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piodoctiQn of RNA transcripts haviiig xxsore desirable propeities, sudi as a greater half-ltfe» tlian 

transcripts produced firom the naturally occurring sequence. 

The invesition also ^iconq)asses production of polynucleotides which encode KPP and KPP 

derivatives, or fragments diereof, entirdy by syndietic chendstiy. After production, the synthetic 
5 polynucleotide n^y be ins^ted into any of the many available expression vectors and cdl systems 

usiDgreageaxts well known in the art Moreover, synthetic chemistiy may be used to introduce 

nmtations into a polynucleotide encoding KPP or any fragment thereof. 

l&xdbodiments of the invration can also include polynucleotides lliat are capable of 

hybridizing to the cfainosd polynucleotides, and, in particular, to those having the sequoices shown in 
10 SEQ ID NO:2 and firagments thereof* und^ various conditions of stringency (Wahl, G.M. and S.L. 

Bwg^ (1987) M^dds EnzymoL 152:399-407; JShna^ A.R. (1987) Methods ^izymoL 152:507- 

51 1). Hybridization conditions, including annealing and wash cmiditions, are described in 

""Definitions/' 

Mediods for DNA sequwciog are known in flie art and imy be used to practice aiqr of 

IS die endbodinaaits of the invention, llxe niediods imy «]q>l0y such engines as the Kl»u>w fragn^^ 
of DNA potymefase I, SEQUENASE (US Biochemical, Cleveland OB), Taq polynaerase (Applied. 
Biosystems), tbermostable T7 polynomse (AmershamBiosciCTces, Piscataway NJ), or conibinations 
of polymerases and proofreading exonucleases sudi as tiiose found in the BLONG ASE anoplification 
system (invitiogen, Carlsbad CA). Preferably, sequence preparation is automated witii madiines such 

20 as the MICROLAB 2200 liquid transfer system (Handlton* Rem NV), PTC200 tiimnal cycles (MJ 
Research^ Wat^town MA) and ABI CATALYST 800 thermal cycler (^plied Biosystems). 
Sequmcing is tiien carded out using eifh^ the ABI 373 or 377 DNA sequencing syst»2 (Applied 
Biosystems), the MEGAB ACE 1000 DNA sequendng system (Amersham Biosciences), or other 
systems known in die art Theresultiiigsequeiu^esareattalyzednsingavarie^ of algorithms which 

25 axe wdl known in the art (Ausubel et al., supra, cIl 7; Meyers, R.A (1995) Molecular Biology and 
Biotechnology . Wiley VCH, New York NY, pp. 856-853). 

The nucleic acids encoding KPP may be extended utilizing a partial nucleotide sequence and 

en^loying various PCR-based methods known in the art to detect upstream sequences, such as 

promoters and regulatory elements. For exanqile, one method which'inay be enqiloyed, 

30 lestriction-site PCR, uses universal and nested primers to an^Ufy unknown sequence from genomic 
DNA within a cloning vector (Sarkar, G. (1993) PCR Methods Applic. 2:318-322). Another method, 
inverse PCR, uses primers that extend in divergent directions to amplify unknown sequence from a 
circularized teniplate. The teoqilate is derived from restriction fragments con9>rising a known 
genomic locos and surrounding sequences (Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186). A 
35 third mediod, capture PCR, involves PCR an^lification of DNA fragments adjacent to known 
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seqaCTces in huznan and yeast attificial duromosoooe DNA CLag^stnun. M. et al. (1991) PGR 
Meduidi Applic. 1:111-119). £afliisn3Blbod,nsiltq>lerestncdon«zym 
nmybensed to insnt an mgineered double-stranded sequ^ice into a xegion of unknown sequ^ice 
bdfofre pedbnmog PGR. Oflii^na^fhods whiGhinay be used to retrieve xin^^ 
S known in die art (Parb^, J.D. et al. (1991) Nucldc Adds Res. 19:3035-3060). Additionally, one n^y 
use PGR, nested prisoers, and PROMOTERFINDER Ubxaries (BD Glontech* Psdo Mto CA) to walk 
gCTomicDNA This procedure avoids die need to screen libraries and is us^id in fi^ 
joncdons. For aBPCR-basedineduids, primers nsry be deigned using coinmi^d 
software, such as OUGO 4.06 primrar analysis software ({Rational Biosciences, F^fnoouth MN) or 
10 anotbv appropriate program, to be alMKit 22 to 30 nucleotides inl^igdi, tohave a GG content of 
about 50% or noore, and to anneal to tibie teoqilate at teo^eratures of about 6S^C to 72*'G. 

Vfb&a. screening for full lengCh cDNAs , it is preferable to use libraries diat have been 
siz^selected to include larger cDN As. In addition, random-primed libraries, which often inchiite 
sequences containing the 5* regions of gaies, are preferable for situations in which an oligo dCT) 
IS library does not ^dd a fidl-l»gQi cDN A Gmonnc libraries inay be usefid for ext^ion of sequence 
into 5* non-transcribed regulatory re^ons. 

CapiDary electrophoresis systems which are commercially available maybe used to analyze 
the size or confirm the.nudeotide sequence of sequencing or PGR products. In particular, capillary 
sequendng may employ flowable polymers for electrophoretic separation, four different nucleotide- 
20 specific, laser-stimulated fiuoresc^t dyes, and a charge coupled device camera for detection of the 
eimtted wavdengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process firom loading of sanities to con9>uter analysis and electromc data display may be computer 
controlled Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
25 . which may be pres^ in limited amoimts in a particular sample. 

In another embodiment of the invention, polynucleotides or fragments thereof which aotcode 
KPP may be cloned in recombinant DNA molecules that direct expression of KPP, or fragmmts or 
functional equivalents thereof, in appropriate host cdls. Doe to the inherent degeneracy of the 
genedc code, other polynucleotides which encode substantially the'san^ of VfiSoctibnany e^val^ 
30 polypeptides may be produced and used to express KPP. 

The polynucleotides of the inv«tion can be ei^gineered using methods gmmdly known in 
the art in order to alt^ KPP*encoding sequences for a variety of purposes incluifing, but not limited 
to, modification of the doiuiig, processing, and/or expression of the gme product DNA shuSBing by 
random fragmentation and PGR reassembly of gene fragments and synthetic oligonucleotides may be 
35 used to engineer Oie nucleotide sequences. For exanqple, oligonncleotide-mBdiated site^directed 
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mutagCTesis nmy be used to introdQce imtations that create new lestiictioii sites, klt^ glycosylaticm 
patterns, change codon preference, produce splice variants, and so forlh. 

Hie nucleotides of die present invention xnay be subjected to DNA shuSIing techniques sudi 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 

5 5,837,458; Chang, C.-C, et al. (1999) Nat Biotechnol. 17:793-797; Christians. RC et al. (1999) Nat 
BiotechnoL 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
inq>rove die biological properties of KPP, siich as its biological or em^maiic activity or its ability to 
bind to other molecules or conq>ounds. DNA shufiBing is a process by which a library of gene 
variants is produced using PCR-mediated reconobination of gene firagments. Ibe library is then 

10 subjected to selection or s^ening procedures diat identify those gene variants with die desired 

properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shufOing and selection/scre^ung. Urns, genetic diversity is created duougji ''artificial" 
breeding and rapid molecular evolution. For exanqple, fragments of a single gene containing random 
point mutations may be reconabined, screened, and then reshufBed until the desired properties are 

15 optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in tbe same gene family, eidier from the same or different species, thereby 
mflTSfniy.ing die genetic diversity of mniltiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, polynucleotides encoding KPP may be synthesized, in i^ole or in 

20 part, using one or more chenucal methods well known ra the art (Caruthers, M.H. et al. (1980) 

Nucleic Acids Symp. Ser. 7:215-223; Horn, T, et al. (1980) Nucleic Acids Synqp. Sen 7:225-232). 
Alternatively, KPP itself or a fragment thereof may be synthesized using chemical methods known in 
the art For exaniple, peptide synthesis can be p^formed nsio^ various solution-phase or solid-phase 
techniques (Creighton, T. (1984) Proteins. Structures and Molecular Properties , WH Freeman, New 

25 York NY, pp. 55-60; Roberge, J.Y. et al. (1995) Science 269:202-204). Automated synthesis may be 
achieved using the ABI 43 1 A peptide synthesizer (Applied Biosystems). Additionally, the amino 
acid sequence of KPP, or any part thereof, may be altered during direct synthesis and/or cpnibined 
with sequences from other proteins, or any part thereof, to produce a variant polypeptide or a 
polypeptide having a sequence of a naturally occurring polypeptide^ 

30 The peptide may be substantially purified by pr^arative high performance liquid 

diromatography (CHiiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421). The 
conqmsition of the synthetic peptides may be confirmed by anuno acid analysis or by sequencing 
(Creighton, supra, pp. 28-53). 

In order to express a biolo^cally active KPP, the polynucleotides encoding KPP or 

35 dmvatives thereof may be inserted into an appropriate mpression vector, a vector which contains 
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flie oecessaiy elemsnts for transcripttonal and transladonal control of flie inserted codBng sequmce in 
a suitable host These dCTrats include legolatoiy seqnoces, sudi as ^ihancers, constitutive and 
inducible piomotm, and S' and 3' untranslated regions in fhe vector and in polynucleotides ^coding 
KPP. Such dements nnyvaiy in dieirstrengdi and specificity. Spedfic initiation signals noay also 
S be nsed to adueve more effident translation of potynucleotidesCTCod^ Such signals include 
flie ATGimtiationcodon and adjacent sequences, e.g. the Kozak sequence. In cases wheie a 
potynucleotide sequence ^coding KPP and its iniliation codon and upstream regulatoiy sequences 
are inserted into the appropriate expression vector, no additional transcriptional or translational 
control signals noay be needed. However, in cases where only coding sequence, or a fragment 
10 dimof , is inserted, exogenous translational control signals including an in-frame ATG initiation 

codon should be provided by die vector. Exogenous translational el^nraxts and iidtiation codons may 
be ofvarions origins, both natural and synthetic. Tbeeffidmcy of expression may be eidianced by 
die inclusion of enhancers appropriate for the particular host odl systrai used (Scharf, D. et aL (1994) 
Results ProbL Cell Differ. 20:125-162). 
IS Methods which are well known to those sldlled in the art may be used to construct expression 

vectors containing polynucleotides encoding KPP and appropriate transcriptional and translational 
control dements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic reconobination (Sambrook and Russell, supra^ cfa. 1-4, and 8; Ausubd et al., 
supra, di. 1, 3, and 15). 
20 A variety of expression vectorVhost systems may be utilized to contain and express 

polynucleotides encoding KPP. These indude, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasimd, or cosmid DNA expression vectors; 
yeast transformed with yeast expression vectors; insect cell systems infected with viral expression 
vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., 
25 cauMower moseuc virus, CaMV. or tobacco mosaic virus, TMV) or whh bacterial expression vectors 
(e.g., Ti or pBR322 plasimds); or animal cell systems (Sambrook and Russell, supra; Ausubel et al., 
supra; Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. 
(1994) Proa Natt. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hutol Gene Ther. 7:1937- 
1945; Takamatsu, N. (1987) EMBO J. 6:307-31 1; The McGraw Hill Yearbook of Science and 
30 Technologv (1992) McGiaw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. ScL USA 81:3655-3659; Harrington, JJ. et al. (1997) Nat. Genet 15:345-355). 
Expression vectors derived from retioAdruses, adenoviruses, or herpes or vaccinia viruses, or from 
various bacterial plasmids, may be used for delivery of polynucleotides to ttie targeted organ, tissue, 
or cell population (Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5:350-356; Yu, ML et al. (1993) 
35 Proc Natl. Acad. Sci. USA 90:6340-6344; BuUer, R.M. et al. (1985) Nature 317:813-815; McGregor, 
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D,P. et al. (1994) Mol, hnmuuol. 31:219-226; Venna, LM. and N. Somia (1997) Nature 389239* 
242). The inventioa is not Ikcdted by the host cdl ^xqployed. 

In bacterial systems^ a mindber of cloinng and expression vectors may be selected depending 
upon the use intended for polynucleotides encodiflig KPP. For exanyde, routine clomr^, subcloidi^» 
5 and propagation of polynucleotides oicoding KPP can be achieved using a miildfunctional K coU 
vector such as PBLUESCRDPT (Stratagene, La Jofla CA) or PSPORTl plasnodd (Ihyitsogen). 
ligation of polynucleotides needing KPP into tbe vector's multiple cloning site disrupts the ladZ 
geaie, aUoiving a colonmetdc scxeeniDg procedure for identification of transformed bacteria 
c mifaitiing rerATtihinat^t mnleciileg. In addition, these vectx)rs may be useful fixT in v&ro trauscripti 
10 dideoxy sequencing, dngle strand rescue with helper phage, and creation of nested ddetions in ttie 
cloned $equ»ce (Van Heeke^ G. and S.M. Sdmstcar (1989) J. BioL Chena. 264:5503-55(»). When 
large qaantities of KPP are needed, e.g. for tbe production of antibodies, vectors which direct hi^ 
level.expressl<ni of KPP may be used. For exanq>le, vectMS contaiiuxis the strong, indudble SP6 or 
T7 bactmophage promoter may be used. 
IS . Yeast expression systraos may be used for production of KPP. A nondber of vectors 

contmning constitutive or inducible promoters, such as alpha foctor, alcohol oxidase, and PGH 
promoters, may be used in tfie yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such 
' vectors direct either the secretion or intraceDtdar retention of expressed protdns and raable 

integration of fordgn polyaadeotide sequences into the host graome for stable propagation (Aosubel 
20 et al., rapra; Bittw-, G.A. et aL (1987) MeAods EnzymoL 153.316-344; Scoter, CA. et al. (1994) 
Bio/Tedmology 12:181-184). 

Plant systons noay also be used for ^pression of KPP. Transcription of polynucleotides 
encoding KPP noay be driven by vind promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination wifli the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
25 6:307-31 1). Alternatively, plant pronx>ters such as the small subunit of RUBISCO or heat shock 
pron»ters may be used (Coruza, Q. et al. (1984) EMBO J. 3:1671-1680; Brogjie, R. et al. (1984) 
Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These 
constructs can be introduced into plant cells by direct DNA transformation or pathogen-mediated 
transfection (The McGraw Hill Yearbook of Science and Technology (1902) McGraw Hfli, New 
30 YorkNY. pp. 191-196). 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovims is used as an expression vector, polynucleotides encoding KPP may be ligated 
into an adenovirus transcripdon/transladon con^lex consisting of tbe late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
35 obtain infective virus which expresses KPP in host cells (Logan, J. and T. Sh^ (1984) Proc. NatiL. 
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Acad. ScL USA 81:3655-3659). In addition, transcription enhancers, such as the Rous saicooia virus 
(RS V) enhancCT. may be used to increase expression in mammfliian host cells. S V40 or 1^ V-based 
vectors may also be used for high-levd protdn expression. 

Hunoan artificial chromDsomes (HACs) may also be employed to deliver larger fragments of 
DNA flian can be contained in and expressed from a plasnrid. HACs of about 6 Kb to 10 Mb are 
constiucted and ddiv^ via conventional ddiveiy mefliods (Uposomes, polycationic amino 
polymers, or vesicles) for flierapeutic purposes (Hanington. JJ. et al. (1997) Nat Genet 15:345-355). 

For long tarn production of recombinant proteins in mammaUan systems, stable expression 
of KPP in ceil lines is prefared. For example, polynucleotides encoding KPP can be transformed 
into ceU lines using expression vectors whieih may contain viral origins of replication and/or 
oidogiaious expression elements and a selectable marker gene on the same or on a separate vector. 
Fonowing the introductionof the vector, odls maybe aHowed to grow for about 1 to 2 days in 
enridied media before being switiihed to selective media. The purpose of the selectable marker is to 
confer resistance to a selective agent, and its presence aDows growth and lecoveiy of cells vAich 
successfully cixpress Oe introduced sequences. Resistant dones of stably transformed cells may be 
propagated using tissue culture tecSmiqnes qtpropriate to the cell type. 

Aiqrimn*er of sdectionsystrans nay be used to lecovM-ttansfiMmBd cell lines, these 
indude, but are not limited to, flie herpes sinopleac virus tiqinidinB kinase and adenine 
phosphoribosyltnuisferase gtaies. for use inrtr and <?>r cells, respectively (Wigler, M. et aL (1977) 
Cen 11223-232; Lowy. I. et aL (1980) Cdl 22:817-823). Also, antimetabolite, antibiotic, or 
hetbidderesistancecanbeusedasfliBbasisforsdection. For exainple.<«^confi»s resistance to 
mefliotrexate; neo CQofiHrs resistance to tiie aminoglycosides neoinycin and G-418; and and /»» 
confer resistance to ddorsulfiuon and pihospiiinotricin acelyltransferase, respectively (Wigte. M. et 
al. (1980) Proc. Nafl. Acad. Sd. USA 77:3567-3570; Colbeie^arapin. F. et al. (1981) J. MoL BioL 
150:1-14). Additional sdectable genes have been described, eg.. /ipB and hisD. whidi alter odhilar 
requirements for metaboUtes (Hartman. S.C. and R.C. Mulligan (1988) Pttjc. NatL Acad. Sd. USA 
85:8047-8051). Visible markers. e.g., anfliocyanins. green fluorescent piotdns (GFP; BD Qontedi), 
p-glucuronidase and its substrate p-ghicuronide. or hiciferase and its substrate hidferin may be used. 

These markers can be used not only to identify transfoimants. but also to quantify flfis luhounnrf 

transient or stable protein expression attributable to a specific vector system (Rhodes. C.A. (1995) 
Metiiods Mol. Biol. 55:121-131). 

Alfliou^ the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirnied. Pbr example, if the 
sequence encoding KPP is inserted wifliin a marker gene sequaice, transformed cells containing 
polynucleotides encoding KPP can be idoitified by the absence of marker gene function. 
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Atteniatively, a inarker geoB canbe placed in tao^ 

control of a single promoter. Expression of die inaiio^ gene in response to induction or 
usually indicates expression of the tandem geue as welL 

In geaxml, liost cdls diat contain the polynncleotide encodii]^ KPP and . diat escpress KPP noay 
5 beideotUiedby a varietyof prooedoreslmovmtodioseof sldUinflm These procedures inclnde, 
but are not linnted to, DNA*DNA or DNA-RNA Iqrbridizations, PCR an^lification, and protein 
bioassay or immunoassay techniques ^vbicSbi inchide membranfi» solotiony or chip based tedmologies 
for the detection and/or quantification of nncleac acid or protein sequeoces. 

Immunological methods for d^ectixig and measuripg the expression of KPP using tidier 
10 specific polyclonal or mom^donal antibodies are known in the art Exanoples of sudi teciimques 
include enzyme-linked immnnosorbmt assays (BUS As), radioimnmnoassays (RIAs), and 
fluorescence activated cell sorting (PACS). A two-^te, monoclonal-based immunoassay utilizizig 
monoclonal antibodies reactive to two non-interfisring epitopes on KPP is preferred, but a conq^tive 
binding assay may be enqployed. These and oth^ assays are wdl known in the art (Hanq>tQn,R.etal. 
15 (1990) SCTologicai Methods, a Laboratory Manual . APS Press, St Paul MN, Sect IV; Coligan, J.E. et 
aL (1997) Current Protocols in Immunology . Greene Pub. Associates and ^WHey-Inteiscience, New 
YodcNY; Pound, J.D. (1998) Immunochemcal Protocols. Humana Press, Totowa NJ). 

A wide varie^ of labels and conjugation techniques are known by those skilled in the art and 
maybe used in various nucleic acid and ansno acid assays. Means for producing labeled 
20 hybridization or PCR probes for detecting sequences related to polynucleotides encoding KPP include 
oligolabeling, nick translation, end-labeling, or PCR an^lification using a labeled nucleotide. 
Alternatively, polynucleotides encoding KPP, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
25 such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted tising a variety 
of commercially available kits, such as those provided by Amersham Biosciences, Promega (Madison 
WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease of 
detection include radiormclides, enzymes, fluorescent, chemilunnnBScent, or chromogouc agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the'like. 
30 Host cdls transfonwd with polynucleotides encoding KPP may be cultured under conditions 

suitable for die expresdon and recoveiy of the protdn from cell culture. The prot^ produced by a 
transformed cell maybe secreted or retained intracellularly depending on the sequ^ice and/or fhe 
vector used. As wiU be und^stood by diose of sidll in die art, expression vectors containi^ 
polynucleotides which encode KPP may be dedgned to contain signal sequences which direct 
35 secretion of KPP through a prokaryotic or eukaiyottc cell membrane. 
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In addition, a host ceU strain may be chosen for its ability to modulalB expiession of the 
inserted polynucleotides or to process the expressed protein in the deshed fashioa Sudi 
modifications of the polypqrtide inctode. but aie not limited to, acetjdation. carboxylation. 
gjycosylation. phosphorylation, Hpidation, and acylation. Post-translational processiflg vAich cdeaves 
a "piepro" or "pro" form of the protein may also be used to spediyprotdn targeting, foldii^g. and/or 
activity. I>ifferenthostcellswMchhavespecificcdlularmachineryandcAaracteristicme<^^ 
for post-tianslational activities (e.g.. CHO. HeLa. MDCK, HEK293. and WI38) are available fiom the 
Am«ican Tjrpe Culture Collection (ATCC. Manassas Va) and may be chosen to ensure the correct 
modification and processing of the foreign protsan. 

In another embodiment of the invention, natural, modified, or recombinant polynucleotides 
encoding KPP may be ligated to a heterologous sequence resulting in translation of a fusion protdn in 
aiiyoffl»afo«ai«itionedhostsystems. For example, a chhneric KPP protein containing a 
heterologous moiety fliat can be recognized by a commercially available antibody may facihtate the 
scieenfa^ofpeptideh^rariesforiiihi^itorsofKPPactivity. Heterologous protein and peptide 
moieties may also faciUtate puiificatian of flision proteins using commerciaHy available affinity 
matrices. Sudi moieties inchide.bm are not Kmited to. glutathione S-transfera^ 
bindmg protein (MBP). thioredoxinCnx), cateiDdulin binding peptide (CBP). 6-His, FLAG, c^myc. 
andhemaggJutinin(HA). GST. MBP. Tn. CBP, and 6-His enable purification of thdr cog^te fusion 
protrinsonimmobilizedghitafliione. maltose, phaiyhirsine oxide, cahnodulm, and metalKielate 
iesins.iespectiv«dy. FLAG, c-iwyc, and heinagglutinin (HA) «iableiinmiinoafiBm 
fiision protdns using commeicially avaih*le monoclonal and polyclonal antibodies that specifically 
recqgnizefteseepitopetags. A fiision protdn may also be cajgineeied to contain a proteolytic 
deavagp site located between the KPP encodiiig sequence and the hetemlogous protein sequence, so 
ihatKPPmaybecIeavedawayfiromtheheterologousmoietyfollowiqgpurificati^ Methodsfw 
fiisionproteinexpressionandpnrificatiottarediscussedinAiisubdetaL (j«pni,c^ A 

variety of commeroially available Mis may also be used to fiuafitate «piession ^ 
fiision {ootons. ( 

In another embodin«ait. synthesis of radiolabded KPP may be achieved in vitro using fhe 
TOTrabbftreticulocytelysateorwheatgermextractsyst*m(Pr»me^^^ These sj^S^-couple - 
transcription and translation of protein-codmg sequoices opnably associated with the T7. T3, or SP6 

promoters. Translation tafas place in the presence of a radidabded amino acid piwursw^ ^ 
example. ^*S-methionine. 

KPP, fragments of KPP, or variants of KPP may be used to screen forconipounds fliat 
specifically bind to KPP. One or more test compounds may be screened for specific bindiflg to KFP. 
In various embodiments. 1, 2, 3. 4. 5, 10, 20. 50. 100. or 200 test compounds can be screened for 
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specific binding to KPP. Eiiamples of test oonqyoimds can include antibodies, andcalins, 
oligonacleotides, piotdns (eg., ligands or receptors), or small molecules. 

In related embodiments, variants of KPP can be used to screen for binding of test compounds, 
such as antibodies, to KPP, a variant of KPP, or a combinaticm of KPP and/or one or more variants 
5 KPP. In an embodiment, a variant of KPP can be used to screen for conqiounds that ^ 

of KPP, but not to KPP having the exact sequence of a sequmce of SEQ ID NO:l. KPP variants used 
to perfcnm such screwing can have a range of about 30% to about 99% sequence identity to KPP, 
widi various embodiments having 60%, 70%, 7S%, 80%, 8S%, 90%, and 95% sequence identity. 
In an einbodimient, a cooqpound idratified in a screen.fi9r specific binding to KPP can be 
10 closely related to flie natural ligand of KPP, e.g., a ligand or fragment diereof , a natural substrate, a 
stroctural or functional mimetic, or a natural binding partner (Coligan, J.E. et al. (1991) Current 
Protocols in. ImmunolofEV l(2):Chapt^ S). In another embodiment, die concqpound thus identified can 
be a natural ligand of a receptor KPP CHoward, A.D. et al. (2001) Trends Pharmacol. Sd.22: 132-140; 
"Wise, A. et al. (2002) Drag I>iscovery Today 7:233-246). 
15 in odier enibodiments, a compound identified in a screen for specific binding to KPP can be 

closely related to the natural receptor to ^ch KPP binds, at least a firagment of die receptor, or a 
fragment of die receptor including all or a portion of the ligand binding site or bindiiig pocket For 
exan^le, die confound m^ be a receptor for KPP which is capable of propagating a signal, or a 
decoy receptor for KPP which is not capable of propagating a signal (Asbkena^, A. and V.M. Divit 
20 (1999) Curr. Opin. Cell BioL 1 1 :255-260; Mantovani, A. et aL (2001) Trends ImmunoL 22:328-336). 
The compound can be rationally designed ming known techniques. Examples of such techniques 
include those used to construct the conq)ound etanercept (ENBREL; Amgen Inc., Thousand Oaks 
CA), which is efficacious for treating rheumatoid arthritis in humans. Etanercept is an engineered 
p75 tumor necrosis factor (TNF) receptor dimer linked to the Fc portion of human IgG j (Taylor, P.C. 
25 et al. (2001) Curr. Opin. Immunol. 13:61 1-616). 

In one embodiment, two or more antibodies having similar or, alternatively, different 
specificities can be screened for specific binding to KPP, fragments of KPP, or variants of KPP. The 

binding specificity of the antibodies thus screened can thereby be selected to identify particular 

^ . firagments or variants of KPP. In one embodiment, an antibody can be selected such fliat its binding 
30 specificity allows for preferential identification of specific fragments or variants of KPP. In another 
embodiment, an antibody can be selected such that its binding specificity allows for preferential 
diagnosis of a specific disease or condidon having increased, decreased, or otherwise abnormal 
production of KPP. 

In an embodiment, anticalins can be screened for specific binding to KPP, fragments of KPP, 
35 or variants of KPP. Anticalins are ligand-binding pmteins that have been constracted based on a 
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lipocalin scaffold (Weiss, GA. and H.B. Lowman 0000) Chem. Biol. 7:R177-R184; Skerra, A. 
QOOl) J. Biotechnol. 74:257-273). The protein afchitectnie of Iqiocalins can include a beta-banel 
having eight andparallel beta-strands, Which supports four loops at its open end. These loops fonn 
the natural ligand-bindiog site of the lipocalins, a site which can be re-engineered in vitro by amino 

5 acid substitutions to impart novel binding spedficilies. Ihe amino acid substitutions can be made 
using mediods known in die art or described herein* and can include conservative substitutions (e.g., 
substitutions tiiat do not alter binding specificity) or substitutions that modesfly , moderately, or 
significantiy alter binding specificity* 

In one embodiment, screening for compounds which specifically bind to, stimulate, or inhibit 

10 KPP involves producing appropriate cells which express KPP, either as a secreted protein or on the 
cell menibrane. Preferred cells can include cells from mammals, yeast, Drosophila, or £1 colL Cells 
expressing KPP or cell membrane fractions which contain KPP are tiien contacted with a test 
compound and binding, stimulation, or inhibition of activity of either KPP or the conq>ound is 
analyzed. 

IS An assasy may simply test binding of a test conq>ound to the polypeptide, wherein binding is 

detected by a fluomphore, radioisotope, enzyme conjugate, or other; detectable label. For exaxnple, 
the assay may comprise the steps of combining at least one test componnd widi KPP, either in 
solution or affixed to a solid support, and detecting the binding of KPP to the compoimd. 
Alternatively, the assay may detect or measure binding of a test con^imd in the presence of a 

20 labeled competitor. Additionally, the assay may be carried out using cell-firee preparations, checoical 
libraries, or natural product mixtures, and die test coiqpoundCs) may be free in solution or affixed- to a 
solid support. 

An assay can be used to assess the ability of a componnd to bind to its natural ligand and/or 
to inhibit die binding of its natural ligand to its natural receptors. Bxanqdes of such assays include 

25 radio-labeling assays such as those described in U.S. Patent No. 5,914,236 and U.S* Patent No. 

6372,724. In a related embodiment, one or more amino acid substitutions can be introduced into a 
polypeptide* compound (such as a receptor) to improve or alter its ability to bind to its natural ligands 
(Matdiews, DJ. and J.A. Wells. (1994) Chem. Biol. 1:25-30). In another related embodiment^ne or 
r more amino acid substitutions can be introduced into a polypeptide compoimd (such as a ligand) to 

30 improve or alter its ability to bind to its natural receptors (Cunningham, B.C. and J. A. Wells (1991) 
Proc. Nad. Acad. Sci. USA 88:3407*3411; Lowman, H.B. et al. (1991) J. Biol. C^iem. 266:10982- 
10988). 

KPP, fragments of KPP, or variants of KPP may be used to screen for compounds that 
modulate the activity of KPP. Sudi compounds may include agonists, antagonists, or partial or 
35 inverse agonists. In one embodiment, an assay is performed under conditions permissive for KPP 
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activity, ^Sieran KPP is combioed witti>at least one test cmnpouDd, and the activity of KPP in the 
presence of a test conqmind is cQoq[>ared iMth die activity of KPP in the abs^ice of the test 
conqxound. A diange in the activity of KPP in the piesence of Oie test confound is indicative of a 
.conqpoundtiiatmodulatestheactivity of KPP. AUematively, a test conqiauiid is conibined mtii an in 

5 vUro or ceU-firee ^st^ comprising KPP muter conditions suitable for KPP activity, and the assay is 
petfonned. b& dtiier of these assays, a test cQii9)oand i^ch modulates ft^ 
indbectly and need not coBoe in direct contact with die test confound. At least one and up to a 
plorali^ of test conqwunds may be screened. 

Li anothi^ embodimeDt, polynndeotides mcoding KPP or thdr mammalian homologs may be 

10 'Imocked out** in an ammal mode] system using homologous leconibination in eiAiyonic st^ O^S) 
ceDs. Such techniques are wdl known in the art and are useftd for the genenitio^ 
human disease (see, e.g., U.S. Patent No. 5,175,383 and U.S. Pat^ No. 5,767,337). For e?cample, ' 
mouse ES cells, sudi as the mouse 129/SvJ ceQline, are dmved from the eady mouse embryo and 
grown in culture. The ES cells are transformed wth a vector containing the gme pf int^^t disrupted 

IS by a nmrkrar g«e, 6.g., tiie neomycin phosphotransferase gme (neo; Capecdii, M.R. (1989) Science 
244:1288-1292). The vector integrates into the corresponding re^on of the host graome by 
homologous recombination. Alternative^, homologous recombination takes place using the C^loxP 
system to knockout a gene of intnest in a tissue- or developmental stage-specific manner (Marth, J.D. 
(1996) Clm. mvest 97:1999-2002; Wagner, K.U. et al. (1997) Nud^c Acids Res. 25:4323-4330).* 

20 TransfonnedEScdIs are identified and microii^ected into mofusecdlblastoi^ste 

die C57BL/6 mouse strain. Hie blastocysts are surgically transferred to pseudopregnant dams, and 
the resulting chimi^c progeny are gCTotyped and bred to produce heterozygous or homozygous 
strains. Transgenic anin^ thus generated may be tested with potential therapeutic or toxic agents. 
Polynucleotides encoding KPP may also be manipulated in ^itro in ES cells derived firom 

25 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm^ mesoderm, and ectod^mal cell types. Hiese cell lineages difft^entiate 
into, for example, neural cells, hematopoietic lineages, and cardionQrocytes (Thomson, J. A et al. 
(1998) Science 282:1145-1147). 

Polynucleotides encoding KPP can also be used to create "knocldn" humanized animals 

30 (pigs) or transgenic animals (mice or rats) to model human disease. Widi kaocMn technology, a 
region of a polynucleotide encoding KPP is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the 
blastidae are implanted as desmbed above. Transgenic progeny or inbred lines are studied and j 
treated with potmtial pharmaceutical agents to obtain infionnation on treatment of a human (fisease. 

35 Alternatively, a mammal inbred to overexpress KPP, e.g., by secretirLg KPP in its milk» may also 
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serve as a co&vem»t smuce of fhat pioteia (Jaime, J. et al. (1998) Biotechsol. Aomi. Rev. 4:S5-74). 
THERAPEUTICS 

Chemical and stnictixFal similaiity, e.g., in ttie context of sequences and niDti£s> GsdsXs 
betwe^ legions of KPP and kinases and phosphatases. In addition, exan^les of tissues expressing 

S KPP can be found in Table 6. Therefore, KPP appears to play a lole iia cardiovascular diseases, 
immune s^tem disordm, neurolo^cal disorders, disorders affecting growdi and devdopment, lipid 
disordears, cell proliferative disorders, and cancm. In fhe treatment of disorders associated widi 
increased KPP expression or activity, it is desirable to decrease die expre^on or activily of KPP. In 
the treatment of disorders associated wifh decreased KPP expression or activity, it is desirable to 

10 increasetheexpressionoractivity of KPP. 

Therefore, in one embodiment, KPP or a firagment or denvative diereof may be administeied 
to a subject to treat or prevent a disorder associated mlh decreased expression or activity of KPP. 
Examples of such disorders include, but are not linuted to, a cardiovascular disease such as 
aiteriovCTOUs fisbda, adierosclerosis, hypertension, vasculitis, Raynaud^ disease, aneurysms, aiterial 

IS dissections, varicose vdns, tiirombop(hlebitis and phlebotGhrombosis, vascular tumors, and 

conqilications of thronibolysis, baDoon angioplasty, vascular replacement, and coronary artery bypass 
graft surgeiy, congestive heart foihire, isGhraouc heart disease, an^na pectoris, myocardial infarction, 
hypertensive heart disease, degen^tive valvular heart disease, calcific aortic valve stenosis, 
cojjgexutaUy bicuspid aortic valve, mitral annular calcification, nntral valve prolapse, iheuroatic fever 

20 and iheumatic heart disease, infective endocarditis, nonbactmal tfaronfl)otic endocarditis, ^idocarditis 
of systemic lupus erythematosus, carcinoid heart disease, carfionayopatiiy, noyocarditis, pericarditis, 
neoplastic heart disease, congeautal heart disease, and complications of cardiac transplantation, 
congenital lung anomalies, atelectasis, pulmonary congestion and' edema, pulmonary embolism, 
puhnonary hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, 

25 obstructive pulmonaxy disease, restrictive puhnonary disease, chmnic obstmctive pulmonary disease, 
^xqphysema, chrome bronchitis, bronchial asthma, bronchiectasis, bacterial pneuinonia, viral and 
niycoplasmal pneumoiua, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, 
pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, 
hyp^sensitivity pneumonitis, pulmonary eosinophilia bronchiolitis dbliterans-organizing pneumonia, 

30 diffuse pulmonary hemonhage syndromes, Goodpasture's syndromes, idiopathic pulmonary 
heoQOsiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar 
proteinosis, lung tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, 
pleural tumors, drug-induced lung disease, radiation-induced limg disease, and complications of lung 
transplantation; an immane system disorder such as acquired inomunodeficiency syndrome (AIDS), 

35 Addison's disease, adult respiratory distress syndrome, aller^es, ankylosing spondylitis, an^loidosis. 
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aneama, asflunay atheroscleFOsiSy ant omimtine hemolytig anenna, autoimoniiie thyroiditis, autDixnmane 
polyendocrinDpafiLy^^asdidiasis-ectodei^^ dystrophy (APBCED), bronchitis* cholecystitte, contact 
dermatitis, Ordm*s disease, atofic dennatitis, domatoniyositis, diabetes nodlitus, raqpihysenia, 
qpisodic lynqpihop^a ^th lyDq)hocytotoxins, eiythioblastosis fistalis, eiydiema nodosuni, atrophic 
5 gastritis, glonoranlonqduitis, 6oo^astuie*s syndrome, gout. Graves' disease, Hashimoto's « 
thyroiditis, hypereosinophOia, irritable bowel syndrome, multiple sclerosis, myastbrada gravis, 
nc^ocardial or pericardial inflammation, osteoartiiritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Rdt^'s syndrome, rheojmtoid arthritis, scleroderma, Sjdgren's syndrome, systemic 
anaphylaxis, systennc lupus erythematosus, systemc sclerosis, thrombocytopeoic purpura, ulc^ative 
10 colitis, uvdtis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
drculation, \iral, bacterial, fungal, parasitic, protozoal, and helnmnOiic infisctions, and trauma; a 
neurological disorder such as epikpsy, ischeimc cerebrovascular disease, sttoke, cmfbral neoplasms, 
Atzhdn^'s disease. Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 
extrapyramidal disordm, amyotrophic lateral sclmsis and other motor neuron ffisordm, progressive 
IS neural muscular atrophy, retiiutis pigmentosa, hmditary ataxias, multiple sclnosis and other 
demyelinating diseases, bacterial and viral menh^tis, brain abscess, subdural en^yema, epidural 
abscess, suppurative intracramal throidbophlebitis, myelitis and radiculitis, viral central nisrvous 
system disease, prion diseases including kuru, Creatzfeldt- Jakob disease, and Gerstmann- 
Straussler-ScheinkBr syndrome, fatal faixnlial insomma, nutritional and metabolic diseases of the 
20 nervous system, neurofibromatosis, tuberous sclerosis, cerebeHoretina] heman^oblastomatosis, 
encephalotrigennnal syndrome, mental retardation and other developmental disorders of the central 
UCTVous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autononnc 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
25 inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 
disorders including mood, anxiety, and schizophrenic disorders, seasoi^ affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dysldnesia, dystonias, paranoid psydioses, 
postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degraeration, 
and fanslial frontoten^oral dementia; a disord^ affecting growtii'alid developnoent suclTas abtinic 
30 keratosis, arteriosclmsis, atiierosclerosis, bursitis, dnhosis, hq>atitis, mixed connective tissue 

disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycytheoua vera, psoriasis, 
primary tiuombocytbenua, renal tubular acidosis, anenua, Cushing's syndrome, achondroplastic 
dwarfism, Duchenne and Beck^ muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR 
syndrome (Wilms' tumor, aniridia, gCTitourinary abnormalities, and mental retardation), Snuth- 
35 Magpnis syndrome, myelod^splastic syndrome, hereditary mucoepitiielial dysplasia, hereditary 
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keratodeixmis, h&redStary nearopathies such as Charcot-Marie-Tooth disease and neurofibromatosis, 
liypothyroidism* hydroc^halus, sdzore disorders such as Syndenhanfs diorea and cerebral palsy, 
spina bifida, an^icephaly, cramoradiischisis, congenital gtauconoa, cataract, and sensorineoral 
hearing loss; a lipid cUsorder such as fatty liver, cholestasis, primary biliary cirdiosis, carnitine 

5 d^biency, carnitine palnntoyltransferase defici^icy, myoadenylate deannnase deficiency, 

hypertriglyceridenaia, lipid storage disorders such Fabry's disease, Gauch^*s disease, Nienoann- 
Pick^s disease, metachromatic leukodystrophy, adrenoleukodjrstrophy, GM2 gangliosidosis, and 
ceroid'lipoiuscinoas, abetalipoproteinenua, Tangier disease, hyperlipoprot^nemia, diabetes melUtus, 
lipodystrophy, lipoajoatoses, acate panrdculitis, dissenunated fat necrosis, adq>osis dolorosa, lipoid 

10 adr^oal hyperplasia, miTOTnal change disease, fipomas, ath^oscl^osis, hyparcholesterolCT:Ea, 

hyp^holesterolenda with hypertriglyceridemia, primary hypoalphalipoproteinenda, hypothyroidism, 
i»a] disease, liver disease, ledflunrcholest^rol acyltransferase deficiency, cerebEroteodinous 
xanthomatosis, sitosterolenda, hypocholesterolemia, Tay-Sachs disease, SandhofTs disease, 
hyperlipideima, hyp^lipemia, lipid noyopathies, and obesity; and a ceU proliferative disorder sudi as 

15 actinic k^tosis, arteriosclerosis, atherosderosis, bursitis, cinhosis, hepatitis, mixed connective 
tissue disease (MCTD), mydofibrosis, paro:3rysmal nocturnal hjemoglobinniia, polycythenua vera, 
psoriasis, prinmy tfirondbocyfliema, and csaacets including adesiocardnoma, leuk^onia, lymphoma, 
' melanoma, n^oma, sarcoma, teratocarcinoma, and, in particular, cancers of flie adrenal gland, 
bladder, bone, bone marrow, brain, breast, c^vix, colon, gall bladdv , ganglia, gastrointestinal tract, 

20 heart, kidney, liv^, lung, nmsde, ovary, pancreas, parathyroid, penis, prostate, salivary glaisds, skin, 
spleCT, testis, fiiymas, thyroid, uterus, leokemias sudi as mnlfiple myeloma, and lymphomas sudi as 
Hodgkin*s <fisease. 

InanoOieranbodimmt, a vector capable of «pressingKPP or a firagment or derivative 
thereof nmy be administered to a subject to treat or prevot a disorder associated widi decreased 
25 expression or activity of KPP induding, but not lindted to, those described above. 

In a furth^ enoibodiment, a composition conqirfeing a substantially purified KPP in 
conjunction witti a suitable pharmaceutical carrier may be adnanistared to a subject to treat or prevent 
a disorder associated with decreased eaqaression or activity of KPP induding, but not limitai to, ,tiios6. 
provided above. - t- . 

30 In still anothi^ embodimi^ an agonist vdiidi modulates the acti\ity of KPP maybe 

administered to a subject to treat or prevent a disorder associated with decreased expresrion or 
activity of KPP including, but not linuted to, tiiose listed above. 

In a furfli^ enntbodiment, an antagonist of KPP may be administered to a subject to treat or 
prevent a disorder associated wifli increased expression or activity of KPP. Examples of such 
35 disorders include, but are not limited to, those cardiovascular diseases, immune system disorders. 
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neiuological disord«s, disorders affecting growth and deveIopmi^[it» lipid disordm, cdl piolifmtive 
disorders, and cancers described above. In one aspect* an antibody which specificalfy binds KPP may 
be used dir^cQy as an antagoiust or indirectly as a targetiiig or deiiveay mechanism for bringti^ a 
pharmaceutical agent to cells or tissues which express KPP. 

5 In an additional CTobodiment, a vector expressing die con^l^nmt of the polynucleotide 

encoding KPP may be adimnistered to a subject to treat or prevent a disord^ associated with 
incieased expression or activity of KPP inclnffing, but not liimted to, diose described above. 

In oflier CTbodimeaits, aixy protein, agonist, antagonist, antibody, complem^itary sequCTce, 
or vector embodinsiits may be admnistered in cond^ination with oth^ appropriate therapeutic 

10 agCTts. Selection of the appropriate agents for use in combination therapy may be imde by one of 
ordinary skin in the art, according to conventional pharmaceutical principles. Tlie combination of 
th»apetttic agents may act syncrgisticalty to effect the treatm^ or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy widi 
lower dosages of each ag^ot, tims reducing the potential for adverse side effects. 

15 An antagonist of KPP may be produced using methods which are genially known in the .art 

In particular, purified KPP may be used to produce antibo^es or to screen libraries of phannaceutical 
agents to identify those vMcix specifically bind KPP. Antibodies to KPP may also be generated using 
methods that are well known in tiie art. Such antibodies may include, but are not limited to, 
polyclonal, nxinoclonal, chimeric, and sin^e chain antibodies. Fab fragments, and fragments 

20 produced by a Fab expression library. In an embodiment, neutralizing antibodies (i.e. , those which 
inhibit dimsr formation) can be used dierapeutically. Single chain antibodies (e.g., from camels or 
llamas) may be potent enzyme inhibitors and may have application in die design of peptide nnmetics, 
and in the development of immuno-adsorbents and biosensors (Muyldermans, S. (2001) J. BiotedmoL 
74:277-302). 

25 For the production of antibodies, various hosts including goats, rabbits, rats, imce, camels, 

dromedaries, llanms, humans, and others may be immunized by injection with KPP or with any 
fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, 
various adjuvants may be used to increase inomunological response. Sudi adjuvants iuclude, but are 
not linuted to, Freund*s, mineral gels such as aluminumhydroxide, and surface activelmbsliuDces sach 

30 as lysolecithin, pluronic polyols, polyanions, peptides, oil OTGoalsions» KLH» and dinitrophenoL 

Among adjuvants used in humms, BCG (badlli Calmette-Guerin) and Corynebacteriwn parvum are 
especially preferable. 

It is prefecred diat the oligopeptides, peptides, or fragments used to induce antibodies to KPP 
have an amino acid sequence consisting of at least about S amino acids, and geo^ally mil consist of 
35 at least about 10 annuo acids. It is also preferable that these oli^pq>tides, peptides, or fragments are 
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substantially idoiticd to a poition of file anmiD add sequ^ Short str^ches 

of KPP annno acids may be fused with fliose of aiio&er protdn, such as KLH, and antibodies to Ihe 
dumeiic nx>lecule may be produced. 

Monoclonal antibodies to KPP may be prepared using any technique vitddtk provides for Che 

5 production of antibody molecules by conlinnouscdll^^ These inchide, but are not 

linnted to, the hybridoma technique, the human B-ce31 hybridoma technique, and die EB V-hybridoma 
technique (Kohler, G. et aL (197S) Nature 256:493-497; Kozbor,, D. et aL (198S) J. bxmmnol. 
Melhods 81:31-42; Cote, R J. et aL (1983) Proc. Natl. Acad. ScL USA 80:2026-2030; Cole^ S.P. et al 
(1984) MoL Cell Biol. 62:109-120). 

10 la addition, techniques devdoped for die production of ^diimefic antibodies," such as the 

splicing of mouse antibody genes to human antibody gqnes to obtain a molecule vnSi appropriate 
antigm specified^ and biological activity, can be used (Morrison, S.L. et al. (1984) Proc. Natl. Acad. 
Sci. USA 81:6851-6855; Neuberger. M.S. et al. (1984) Nature 312:604^08; Takeda, S. et al. (1985) 
Nature 3 14:452-454). Altemativdy, techniques described for the production of singje chain 

15 antibodies may be adapted, using methods known in the art, to produce KPP-spedfic single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic conq)osition, may be 
generated by chain shuffling firom random combinatorial immunoglobulin libraries (Burton, D.R. 
(1991) Proc. Nati. Acad. ScL USA 88:10134-10137). 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 

20 popidation or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature (Qrlandi, R. et al. (1989) Proc. Natl. Acad. Sci USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299). 

Antibody fragments wMch contain specific binding sites for KPP may also be generated. For 
example^ such fragments include, but are not linnted to, F(ab')2 fragments produced by pepsin 

25 digestion of the antibody molecule and Fab fragments generated by redudng the disulfide bridges of 
the F(ab*)2 fi-agments. Altmmtively, Fab expression libraries may be constructed to allow rapid and 
easy idcaitification of monoclonal Fab fragmraits wilh the desired spedfidity (Huse, W.D. et aL (1989) 
Science 246:1275-1281). ^ 

Various immimoassays may be used for screwing to identify antibodies having ihe'd^ifed"' 

30 spedficiQr. Numerous protocols for con^etitive biiiding or intimanoradion^ 

polyclonal or monoclonal antibodies with established specificities are well known in die art Such 
immunoassays typically involve die measurem»it of complex formation between KPP and its specific 
antibody. A two-aite, noonoclonal-based immunoassay utilizmg monoclonal antibodies reactive to 
two non-interfering KPP e^ntopes is generally used, but a competitive bindiDg assay may also be 

35 enydoyed (Pound, supra). 
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Vaiions mefliods sadh as Scatdhard analysis in conjunctiQii witb radioimnmnaassay 
tedimqaes may be used to assess ^ afBsity of antibodies for KPP. AfBnity is expressed as an 
assodation constant. l^,^iiiGli is defined as the xnolarconceatration of KPP-antibo<ty 
divided by tbe molar concentrations of fcee antigm and firee andbody under equiHbrtiun conditions. 

5 The Ka detenmned for a preparation of polyclonal antibodies, ^ch at6b^m>geneous in didr 
afBnities for miiltqple KPP epitopes* rqpres^its (he average affinity, or avidhy, of the antibodies for 
KPP. The K^deteramned for a preparation of monoclonal antibodies, ^wbi^ 
particular KPP epitope, represCTts a true measure of afBnity. High*afBnity antibody preparations 
with K« ran^mg firom about 1 0^ to 1 0^ L/cnole are prefened for use in immunoass a^ in v/hich die 

10 KPP-antOio^con^lexniastmtiistandrigoroQsnaanqpu^ Low-afBni^ antibody preparations 
with K^ ranging from about 10^ to 10^ L/mole are preferred far use in inwnnnopmificatiQn and sinalar 
procedures wbicSi ultimately require dissociation of KPP, prefmbly in active form, firom die antibody 
(CatQr, D. (1988) Antiboffies^ Vohmae I: A Practical Approach. IRL Press, Washington DC; liddeil, 
J.R and A. Cry^ (1991) A Practical Guide to Monoclonal Antibodies. John Wiley St Sons, New 

15 York NY). 

The titer and aiddi^ of polyclonal antibody preparations may be furtlier evahiated to 
detenxnnediequaUty and suitabiliQr of such preparations for certain For 
exanqile, a polyclonal antibody preparation containing at least 1*2 rpg spedfic antibody/ml, 
preferably 5-10 ing specific antibody/nQl, is generally eofiployed in procedures requiring precipitation 

20 of KPP-antibody complexes. Procedures for evaluating antibody specificity, titer; and avidity, and 
guidelines for antibody quality and usage in various applications, are gen^aUy available (Catty, 
supra; Coligan et al. , supra). 

In another embodin^t of the invention, polynucleotides encodiog KPP, or any fragment or 
conq)lement tiiereof , may be used for tiierapeutic purposes. In one aspect, modifications of gene 

25 expression can be adbieved by designing con^l^nentary seqaences or antisense molecules (DNA, 
RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
KPP. Such tedmology is well known in the art» and antisense oligonucleotides or larger firagments 
can be designed fi-om various locations along the coding or control regions of sequences encoding 
KPP (Agrawal, S., ed. (1996) Antisense Therapeutics ^ Humana Press, Totawa NJ). '* ^ 

30 In therapeutic use, any gene delivery system suitable for introduction of the antisense 

sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellidarly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementaxy to at least a portion of the cellular sequence encoding die target protein (Slater, J.E. et 
al. (1998) J- Allergy Clin. Immunol. 102:469-475; Scardon, KJ. et al. (1995) FASEB J- 9:1288- 

35 1296). Antisense sequences can also be introduced intracelhilarly duough die use of viral vectors, 
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such as lettoviius and adeno-assodated vkos vectors (Miller, A.D. (1990) Blood 76:271-278; 
Austtbel et aL, supra; Uckeit, W. and W. Waldier (1994) Fhaimacol. Ther. 63:323-347). Otber gene 
delivery mechanisms include liposome^denved systems, artifidLal viral envelopes, and other systems 
Icnown in the ait (Rossi, JJ. (1995) Br. Med. Bull. 51 :217-225; Boado, R. J. et al. (1998) J. Fhatm. 

5 ScL 87:1308-1315; Morns, M.C. et al. (1997) Nucleic Acids Res. 25:2730-2736). 

In another oobodimBnt of flie inventicm, polyuucteotides racoding KPP may be used for 
somatic or gemdine gene flimpy. - Gene fhmpy may be p^fomoed to Q) correct a genetic defici^acy 
(e.g., in die cases of s&vm condbined immonodeficiency (S^C1D)-X1 disease characterized by X- 
linted inhoitance (Cavazzana-Calvo, M. et aL ^000) Science 288:669-672), sevm cond^ined 

10 immanodefici^icy syndrome associated vtiih an inhimted adenosine deaminase (ADA) deficiency 
(Blaese, R,M. et al. (1995) Sdmce 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis iZOmsr, J. et aL (1993) Cell 75:207-216; Crystal, R.O. et al. (1995) Hum Gene 
Therapy 6:643-666; Crystal, R.G. et aL (1995) Hum. <3ene Tb^apy 6:667-703), dudassannas, familial 
hyp»Aolesterolenua, and h^nq|AdIia resultiiie firom Factor Vm or Factor IX deficiencies (Crystal, 

15 R.G. (1995) Sci^ice 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (u) 
express a conditionally ledial gene product (e.g., in die case of cwsxss which result firom unregulated 
ce31 profiferation), or Cui) express a protdn wbidi affords protection against intracdliilar parasites 
(e.g., against human retroviruses, sudi as human immunodeficiency virus (HIV) (Baltimore^ D. 
(1988) Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Nati. Acad. Sd. USA 93:1 1395-1 1399), 

20 h^atitis B or C virus (HBV, HCV); fimgal parasites, such as Candida albicans and Paracoccidioides . 
brasiliensis; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzl)* &i the 
case where a g^ietic deficimcy in KPP expression or regulation causes disease, the expression of 
KPP firom an appropriate population of transduced cells may alleviate the climcal manijkstations 
caused by the genetic d^ciency. 

25 In a further exnbodiment of the invmtion, diseases or disorders caused by deficiencies in KPP 

are treated by constructing mammalian expression vectors encoding KPP and introducing these 
vectors by mechanical means into KPP-deficient cells. Mechanical transfer technologies for use widi 

ceQs in vivo or ex vitro include (i) direct DNA nncroinjection into individual cells, (ii) ballistic gold^ 

- particle delivery, (in) liposonoe-mediated transfection, (iv) receptor-niediated gene transfer, and (v) 

30 the use of DNA transposons (Morgan, R.A. and W.F. And^on (1993) Amm. Rev. Biochem. 62:191- 
217; Ivies, Z. (1997) Cefl 91:501-510; Boulay, J.-L. and H. R&ipon (1998) Curr. Opin. Biotedmol. 
9:445-450). 

Expression vectors that may be effective for the expression of KPP include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOP0TA vectors 
35 Onvitrogen, Carlsbad CA), PCMV-SCRBPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), 
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and PTET-OFF. Fmr-ON. FrRE2. PTRE2-LUC. PTK-HYG (BD Qontech. Palo Alto CA). KPP 
imy be expressed using (i) a constitutively active promoter. (e.g.. from cytornegalovirus (CMV). 
Rods sarcoma virus (RS V). SV40 virus, thymidine kinase CTK). or P-actin genes). Cii) an inducible 
pniiiioter(e.g..llietetracycline-regulatedpronx,ter(Gossen.M andH.BujM^ Nafl 
Acad. ScL USA 89:5547-5551; Gossea. M. et aL (1995) Science 268:1766-1769; Rosd. F.M.V. and 
H.M. Blau (1998) Curr. Opin. BiotechnoL 9:45 1-456). commercially available in the T-REX plasmid 
Onvitrogen)); the ecdyson^inducible promoter (available in the plasmids PVGRXR and PIND; 
mvitrogHi); a» FKSOfi/rapamydn inducible promoter; or the RU486/inifepristone inducftle promoter 
(Rossi, F.M. V. and H.M. Blau. supra)), or (iii) a tissue-specific promoter or Oe native promoter of 
flie endogenous gene enbodmg KPP from a normal individuaL 

Commerciany available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECnON KIT. available from Hmtrogen) allow one xviflx ordinary skill in the art to deUver 
j)olynndeolides to target cdb in cuhuie and reqpiremm^ 

parameters. In alternative, transformation is peifoimedusii^ the calcium phosphate 
(Gniham. F.L. and A.J. Eb (1973) Virology 52:456-467). or by dectioporation (Neumann. E et al 
(1982) EMBO J. 1:841-845). Tl>e introduction of DNA to primary cdls requires modification of 
Oese standaMized manmialian tiansfection protocols. 

In another embodimrait of the mveolion. diseases or disord«s caused by g«^ 
respect to KPP expression are treated by constructing a letioviius vector consisting of (i) the 

pdynucleotideencodiug KPP under fl»contr«dofanindq«ndent promoter or ft^ 
tominal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and Cai) a Rev-responsive 
dai^nt (RRE) along with additional retrovirus cfa-acting RNA seq^awes and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stxatagene) and are based on published data (Riviere. L et al. (1995) Pioc 
Nafl. Acad. Sci. USA 92:6733-6737). mcoiporated by reference herein. The vector is propagated m 
an appropriate vect»r producmg cdl line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope piotdn such as VSVg (AimMtano. D. et aL 
(1987) J. ViroL 61:1647-1650; Bender. M.A. et aL (1987) J. ViroL 61:1639-1646; Adam, M.A. and 
AD. Miller (1988) J. ViroL 62:3802-3806; Dull. T- et al. (1998) J. ViroL 72:8463-847 l.-Zu^R: 
et al. (1998) J. ViroL 72:9873-9880). U.S. Patent No. 5.910.434 to Rigg C'Method for obtaining 
retroviras packaging ceH lines producing high transducing efficiency retroviral supernatant") 
discloses a method for obtaming retrovirus packaging cell hues and is hereby incorporated by 
reference. Propagation of retrovirus vectors, transductionof a population of cells (e.g., CD4*T. 
cells), and the return of transduced cells to a patient are prtK^edures wefl known to persons sMHedin 
the art of gene therapy and have been wdl documented (Ranga, U. et al. (1997) J. ViroL 71:7020- ' 
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7029; Bauer. G. et al. (1997) Blood 89:2259-2267; Bonyhadi. M.L. (1997) J. Virol. 71:4707-4716; 
Rai^ U. et al (1998) Proc. NaU. Acad. ScL USA 95:1201-1206; Su. L. (1997) Blood 893283- 
2290). 

In an onbodimrait. an adenoviius-based gene therapy delivery system is used to deliver 
polynucleotides encoding KPP to cells ^ch have one or more genetic abnormalities with respect to 
ttieexpressionofKPP. Tlieconstructionandpackagingof adenovirus-based vectors are weD known 
tolhosewithoidinaiyskinintheart RepKcation defective adenovirus vectors have proven to be 
versatile far importiag genes encoding immnnoregulatory proteins into intact islets in the pancreas 
(Csete. M.E. et al. (1995) Transplantation 27.-263-268). Potentially useM adenoviral vectors are 
described in U.S. Patent No. 5.707,618 to Annentano C'Adenovirus vectors for gene therapy"). 
her*yincoiporated by reference. For adenoviral vectors, see also Antinozzi. P.A. et al. (1999; Annu. 
Rev. Nntr. 19:51 1-544) and Venna. I.M. and N. Somia (1997; Nature 18:389.-239-242). 

In another eoftiodiment, a herpes-based, gene therapy ddlveiy systan is used to deUver 
polynucleotides encoding KPP to target cells wWdi have one or more genetic abnormalities with 
respecttoflieexpressionofKPP. The use of herpes simplex virus (HSV)-based vectors may be 
espeddly vahiable to introducing KPP to cells of the central nervous sys^ 
ttopism. 'Hieconstnictionandpackagingofherpes.basedveclwsarewenimowntolh^ 
ordinaiysHlinflieart AreplicationKam^wtentherpes siin0ex virus (HSV) type l-b^^ 
been used to deliver a reporter gene to the eyes of primates (Uu. X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-l virus vector has also be«i disclosed in detafl in U.S. 
Patent No. 5.804.413 to Del^ C'Herpes simple virus slrams for gene transfer"), ^di is hereby 
incorporated by reference. U.S. Patent No. 5.804.413 tead.es the use of recombinant HSV d92 vOach 
consists of a genome containing at least one exogfflMus gene to be transfered to 
control of the appropriate promote for purposes indudiflg human gpnethaapy. Also taught by this 
patent are the constraction and use of recombinant HSV strains deleted for ICP4. ICP27 and ICP22. 
For HSV vectors, see also Coins, W.F. et al. (1999; J. Virol. 73:519-532) and Xu, R et al. (1994; 
Dev. Biol. 163:152-161). Tlienianipulation of cloned herpesvirus sequences, the generation of 
recombinant vims following the transfection of multiple plasmids containing difierent segmtaits of 
the large herpesviras genomes, the growth and propagation of herpesvirus, and the infection of cells - - 
wifli herpesvirus are techniques well known to those of ordmary skill in the art 

In another embodiment, an alphavinis (positive, singl^stranded RNA virus) vector is used to 
ddiver polynucleotides encoding KPP to target cdls. lUe biology of the prototypic alphavinis. 
SemBM Forest Virus (SFV). has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff. H. and K.-J. U (1998) Curr. Opin. Biotechnol. 9:464-169). During 
alphavinis RNA lepUcation. a subgenomic RNA is generated tiiat nonnally encodes the viral capsid 
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protdos. lUssubg^nonncRNA silicates to liigb^ 

lesoltiiig in flie ov^rodaction of ciapsid proteins relative to flie viral protons with enigmatic activity 
(e.g., protease and polymerase). Sinmlarly, inserting the codiiig sequCTce for KPP into the alpl^ 
gerome in place of flie cap^d-codipg region results in the production of a larg^ nDnA>er of KFP- 

5 coding RNAs and the synfliesisofhig^ levels of KPP in vector trai^ While alphaviius 

infection is typically associated with cdl lysis within a few days, the ability to establish a persistent 
infection mhamst^ nonnal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that 
die lytic replication of alphaviruses can be alt^ed to suit the needs of die gene therapy application 
^>ryga, S.A. et aL (1997) Virology 228:74-83). Hie wide host range of aljfliavirttses win allow die 

10 introdoctionofKPP into a variety of cell types. Tbespedfic transduction of a subset of cdDs in a 
population may require die sorting of cdls prior to transduction. The mettiods of manipulating 
infectious cDNA clones of alphaviruses* p^orming alphavirds cDN A and RNA transfections, and 
pealbnimg alphavirus infections, are w^ Imown to 

Oligonucleotides derived firom the transcription initiation site, e.g., b^ween about positions 

IS -lOand+lOfromthestartsite, may also be en^loyed to inhibit gene expression. Similarly, 

inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double helix to open sufficientiy for the binding of 
polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using 
triplex DNA have been described in the literature (Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 

20 Molecular and Imnninoloedc Approaches . Futura Publishing, Mt Kisco NY, pp. 163-177). A 

complementary sequence or antisense molecule may also be designed to block translation of mRN A . 
by preventing the transcript £rom binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequcbce-specific hybridization of the ribozyroe 

25 inolecule to con[q>lementary target RNA, followed by endonucleolytic cleavage. Forexanqile, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of RNA molecules ^coding KPP. 

Specific ribozyme cleavage sites witldn any potential RNA target are imtially identified by 
scanidng the target molecule for ribozyme cleavage sites, including the followii^ sequencesr'GUAT ' 

30 GUU, and GUC Once idraitified, short RNA sequences of between 15 and 20 ribonudeotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondaiy structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization widi complemi^tary 
oligonucleotides using ribonudease protection assays. 

35 Conyilementacy ribonncleic add molecules and ribozymes may be pr^ared by any meOiod 
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loiown in flie art for the synthesis of imdei^ These inchide teduiiqaes for cheancally 

^nthesizixig oligaimcleotides such as solid phase phosphoraniidtte cheoucal synthesis. AltanatiVelyy 
RN A mDlecnles may he gemmted by in vitro and in vivo transcription of DN A molecules ^u»>dix]e 
KPP. SiidiDNAseqaencesmaybeincoiporatedintoa wide variety of vectors 

5 polymerase promoters sucb as T7 or SP6. Alt^nativdy, these cd3N A constructs dut syndiesize 
conqplemaotaty RN A, constitntivdy orindodbly, can be introduced into cdl lines, ceOs, or tissues. 

RNA molecules niiay be modified to increase intracdlolarstabiU^ and h^^ Possible 
modifications indude, but are not linsted to, the addition of fianlcmg sequCTces at the S* and/or 3 ' 
ends of the moleccde, or diense of phosphorothioate or 2* O-melhyl radier than phosphodiestmise 

10 linkages witfiin the backbcniB of the molecule. TUs concept is inherent in the production of FN As. 
and can be ^tended in aU of these nxdecules by the inchision of nontraditional bases such as inosine^ 
qneosine, and wybutosine, as well as acetyl*, methyl-, lliio-, and sinnlarly nxidified forms of a^tenine, 
cytosine, guaiune, thynune, and nracil^iiUcA are not as easily recognized by endogenous 
endonucleases. 

IS In other embodiments of theinvendon, the expression of one or noore selected 

polynucleotides of die present invention can be altered, inhibited, decreased, or silenced using RNA 
intecfer^ice (RNAO or post-transcriptional gene silencing (PTGS) methods known in the art. RNAi 
is a post-transcriptional mode of gene silencing in which double-stranded RNA (dsRN A) introduced 
into a targeted cdl specifically suppresses the expression of the homologous gene (i.e., the gene 

20 bearing the sequence con^l^nentary to the dsRN A). This effectively knocks out or substandaSy 

reduces the expression of the targeted gene. PTGS can also be accomplished by use of DNA or DNA 
fragments as welL RNAi methods are described by Fire. A et al. (1998; Nature 391:806-811) and 
Gura, T. (2000; Nature 404:804-808). PTGS can also be initiated by introduction of a 
conqplementary segnient of DNA into the selected tissue using gene delivery and/or viral vector 

25 delivery methods described herdn or known in the art 

RNAi can be induced in mammalian cells by the use of small interfering RNA also known as 
siRNA siRNA ate shorter segments of dsRNA (typically about 21 to 23 nucleotides in length) that 
result in vivo from cleavage of introduced dsRNA by the action of an endogenous ribonuclease. 
siRNA appear to be the mediators of the RNAi effect in mammals. The most'efifective raRNAs" 

30 appear to be 21 nucleotide dsRNAs with 2 nucleotide 3' oveihangs. The use of siRNA for indudng 
RNAi in mammalian cdls is described by Elbashir, S.M. el al. (2001 ; Nature 41 1 :494-498). 

siRNA can be gyrated indirecdy by introduction of dsRN A mto the targeted cell. 
Alternatively, siRNA can be syndiesized direcdy and introduced mto a cell by transiection methods 
and agents described herdn or known in the ait (sucb as liposome-mediated transfecdon, viral vector 

35 methods, or odier polynucleotide ddivery^ptroduc^ Suitable siRNAs can be selected 
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by examiimig a transcript of flie taiget polynucleotide (e.g.. niRNA) fenucdeotMe seqaences 
dowmtream from the AUG start codon and recoiding the occumaice of «^ 
adjaceot 19 to 23 nucleotides as potential siRNA target sites, wifli sequences having a 21 nucleotide 
lengfli being preferred. Regions to be avoided for target siRNA sites include the 5' and 3' untranslated 
regions (UTRs) and regions near the start codon (within 75 bases), as these may be richer in 
regulatory proton binding sites. UTR-binding proteins and/or translation initiation complexes may 
iiit«ferewilhbindingoffl»siRNPendonucleasecomplex. The selected target sites for siRNA can 
dicaibe cdnpaied to Che appropriate genome database (e.g.. human, etc.) using BLAST or other 
sequencecomparisonalgoriflmislaiownintheart. Target sequences with significant homology to 
ofliercodingsequenoescanbeeliminatedfromconsideration. the selected siRNAs can be produced 
by chemical syBfl«»is mefliods Imown in the art or by m vft« transcription usi^^ 
available methods and kits sue* as fl» SILENCER siRNA construction kit (Ambion, Austin TX). 

In attemalive embodiments, long-term gene silencii,g and/or RNAi effects can be induced in 
seOectedtissueusingexpressionvectorsaatconlinnouslyexpresssiRNA TTiis can be accomplished 
usiAg expression vectors that are e^gine«ed to express hairpin RNAs (shRNAs) u sing methods 
knownin flie art (see. e.g.. Brammeakamp. T.R. et aL (2002) Science 296:550-553^ and Paddison, P.J 
ctaL (2002) G«»s Dev. 16:948-958). In fliese and reUted embodiments, shRNAs can be delivered to 
ta,getcdlsusii««p,essionvectdrsknowninfteart. An example of a suitable expression vector for 
ddiveiyofsiRNAis1hePSILENCER1.0.U6(ci«ndar)ph«mnd(Aiii^ Once ddivered to the 
targpt tissue. shRNAs are processed in vivo into siRNA-like molecules capable of canying out gene- 
specific silencii^ 

In various embodiments, the expression levels of goies targeted by RNAi orPTOS meOtods 
canbedeterminedbyassaysformRNAand/orproteinanalysis. Expression levels of fl»mRN A of a 
targeted gene can be detemlned. for «awii5,le. by northern ani^ 

NORTHERNMAX-GLY kit (Ambiim): by microarray methods : by PGR methods; by real ^ 
methods; andby other RNA/polynucleottdeassaysknowniptheartordescribedhenm Expression 
levels «f the protein encodedby the targeted genecanbedetermined. for example, by micaoartay 
meAods ; by polyacrylamide g<d dectrophoresis ; and by Western analysis using standard techniques 
known in the art. 

An additional embodiment of the invention encompasses a me&od for screening for a 
compound which is effective in altering expression of a polynndeotide encoding KPP. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but aie not 
limited to. oligonucleotides, antisense oUgonucleotides, triple helix-forming oHgonucleotides. 
transcription foctors and other polypeptide transcriptional regulators, and non-maaomoleculJr 
chemical entities which are capable of interactii^ with specific polynucleotide sequences. Hfective 
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conqpounds may alter polynucleotide expiesdon by actiiig as dfher inliibitors or promoters of 
polynacleotide expressioii. . Thus, in the treatment of disordm associated widi incieased KPP 
expression or activity^ a conqpound which specifically inhibits expression of the polynudeotide 
encoding KPP may be iberapeotically useful, and in I3ie treatment of disorders associated with 

S decreased KPP expression or activity, a conqiound which specifically promotes expression of die 
polynudeottde ^icodiog KPP may be therapeutically us^oL 

In various embodiments, one ormore testconqmunds may be soneened for efiectiveness in 
alteting expression of a specific polynacleotide. A test compound may be obt^ned by any method 
commonly known in the art» including chemical modification of a conqNJund known to be effective in 

10 altering polynucleotide expression; selection ficom an existing, comnoerdally-available or proprietary 
library of naturally-occorcing or non-natural chCTiical compounds; rational design of a compound 
based on cbenucal and/or stmctural properties of the target polynacleotide; and selection from a 
library of chemical compounds created conibinatorially or randomly. A sample comprising a 
polynucleotide encodiog KPP is exposed to at least one test compound Ibus obudned. Ibe sanqile 

IS may coinprise, for example, an intact or permeabilized cell, or an m vitro cell-fiee or reconstituted . 
biochemical systrai. Alterations in the eximssion of a polynucleotide encoding KPP are assayed by 
any method commonly known in die art. TVpcally, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of die polynucleotide encoding KPP. The amount of hybridization may be quantified, thus forming 

20 the basis fox a conqparison of the expression of fiie polynucleotide both with and without exposure to 
one or more test compounds. Detection of a change in the expression of a polynucleotide exposed to 
a test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering rapression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomyces pombe gene expression system 

25 (Atfcins, D. et al. (1999) U.S. Patent No, 5,932,435; Amdt, G.M. et al, (2000) Nucleic Acids Res. 
28:B15) or a hiunan cell line such as HeLa cell (Claike, M.L. et al. (2000) Biochem. Biophys. Res. 
Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity^ against a specific polynucleotide 

30 sequence (Bmice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

Many methods for introducing vectors into ceUs or tissues are available and equally suitable 
for use in vivo^ in vitro ^ and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into tiiat same patient. 
3S Delivery by transfection, by liposome injections, or by polycatiomc amino polymers may be achieved 
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using floefliods are weQ known in flie ait (Goldman, C.K. el al. (1997) Nat Biotechnol. 15:462- 
466). 

Any of die then^peutic meOiods described above may be applied to any subject in need of 
sudi dimpy, inctaffing, for example, Tnammarc such as bumans, dogs, cats, cows, horses, rabbits, and 
S monkeys. 

An additional embodiment of the invendon relates to the adnmnistraticm of a conqiositi<»i 
which gpnefaUy conoprises an active ingredient fornndated widi a phanmceuticaHy accqitable 
excipient Excipients may include, for exanqile, sugars, starches, ceDnloses, gams, and proteins. 
Various fomadations are commonly known and are thoroughly fiscussed in the latest edition of 

10 Rennngtan's Hiarmaceutical Srienoes fMaack Publishing. Easton PA). Sudi compositions may 
consist of KPP, antibodies to KPP, and nnmetics, agonists, antagonists, or inhibitors of KPP. 

In various «d>odiments, the conq>ositions described heron, such as pharmaceutical 
cornpositions. maybe adnnnisteiedby any number of routes including, but not limited to, oral, 
intravenous, intramnscolar, intra-arterial, intramedullary, intrathecal, intraventricular, pulmonary, 

IS transdermal, subcutaneous, intraperitoneal, intranasal, ent^al, topical, sublingual, or rectal means. 

ConqK>sitions for pulmonary administration may be prepared in liquid or dry powd^ form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of 
^t-acting formulations is well-known in the art In the case of macromolecules (e.g. larger peptides 

20 and proteins), recent developments in the field of pulmonary deliv^ via the alveolar region of the 
hing have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g.. Fatten, 
J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery allows adnnnistration without needle 
injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include conq>ositions wherein the active 

25 ingredients are contained in an efliective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of diose skilled in the art. 

Specialized forms of coiiq>ositions may be prepared for direct intracellular delivery of 
macromolecules comprising KPP or fragments thereof. For «anq)le, liposome preparations 
containing a cdl-i£q>ermBable macromolecule noay promote ceU-fosion and intracdlular deliveiy bf 

30 the macromolecule. Altemativety, KPP or a fragoKnt thereof may be joined to a short cationic N- 
termioal portion firom the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the odls of an tissues, including the brain, in a mouse model system (Sdbwaize, S.R. et 
al. (1999) Sdence 285:1569-1572). 

For any conopound, the th^apeuttcaDy effective dose can be estimated initially dtiier in cell 

35 culture assays, e.g., of neoplastic cdls, or in animal models such as ndce, rats, rabbits, dogs. 
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moiike^, or pigs. Anammalmodelimy alsobeiisedtodetenn^ 

raj^e and route of adnnnistratiQiL SuchinfomiatiQncaallimbeiisedtodetmim 

routes for adnniustration ia humans. 

A tbrarapeuticaDy^fective dose refers to tlmtaiix^ forexaoqdeKPP 
5 or fragments tbmof , antibodies of KPP, and agonists, antagomsts or inbibitors of KPP, vHActx 
amdiorates the syo^toms or condition. Therapeutic ^Bcacy and toxicity nsay be det^nmed by 
standard phanmceutical procedures in cdl cidtnres or widi experinorai^ 

calculating the ED50 (the dose therapratically ^fective in 50% of die populatiou) or LD50 (the dose 
ledial to 30% of die population) statistics. The dose ratio of toxic to dierapeudc eifiscts is die 

10 therapeutic index, which can be expressed as flieLDs(/ED5o ratio. Compositions which exhibit large 
therapeutic indices are piefimed. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in sudi compositions is 
preferably widun a range of circulating conc^itrations that includes die ED50 with litfle or no toxidly. 
The dosage varies within this range dq^ending upon the dosage form employed* the sa[isitivity of the 

15 patient, and the route of adnnmstration. 

Iheexactdosage wiUbedet^rnmnedbythepractition^, inlightof fectors rdatedtothe 
subject requiring treatment Dosage and adnnxustration are adjusted to provide suffici^ 1^ 
active moiety or to v^\v(t^m the desired effect. Factors which may be taken into account include the. 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 

20.. subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 

response to therapy. Long-acting conqpositions may be adnanistered every 3 to 4 days, every week,, 
or biweekly depending on the half -life and clearance rate of die particular formulation. 

Normal dosage amounts may vary from about 0.1 //g to 100,000 /ig, up to a total dose of 
about 1 gram, depending upon the route of adndnistration. Guidance as to particular dosages and 

25 methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will en^loy different formulations for nucleotides than for proteins or their 
inhibitors. Sinolarly, delivery of polynucleotides or polypeptides will be specific to particular cdls, 
conditions, locations, etc. 

DIAGNOSTICS - ' " 

30 In another embodiment, antibodies which specifically bind KPP may be used for the 

diagnosis of disorders characterized by expression of KPP, or in assays to momtor patients being 
treated with KPP or agonists, antagonists, or inhibitors of KPP. Antibodies useful for diagnostic 
purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for KPP include mediods which utilize the antibody and a label to detect KPP in human body fluids 
35 or in extracts of cells or tissues. Theantibodiesmaybeused withor withoiitncxidification, andim^ 



66 



PF.1722P 

be labeled by co valent or non-ooval^ attaduDent of a rqwjter molecule. A wide variety of reporter 
molecides, several of vftich are desonbed above, are knowa in the art and may be used. 

A variety of protocols for measuring KPP, including ELISAs» RIAs, and FACS, are Imown in 
the ait and provide a basis for diagnosing altered or abnormal levels of KPP expression. Normal or 

5 standard values for KPP expression are established by coinbiningbody fluids or cell extracts taken 
from normal mammalian subjects, for example, human subjects, with antibodies to KPP nnder 
conditions suitable for conqflex formation. The amount of standard complex formation may be 
qnantitated by various methods, such as photometric means. Quantities of KPP expressed in subject, 
control, and disease samples fiom biopsied tissues are compared wifli the standard values. Deviation 

10 betwem standard and subject values establishes the paranoetos for diagnosing disease. 

In another endbodiment of the invention, polynucleotides eocodii^ KPP may be used for 
diagnostic purposes. The polynucleotides wfaidi may be used include oligonucleotides, 
complemeotary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
. and quantify gene expresdon in biopsied tissues in which expression of KPP may be corrdated with 

IS diseasCi The diagnostic assay may be used to detramine absence, presence, and ^cess expression of 
KPP, and to monitor regulation of KPP levels'during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotides, 
including genomic sequoices, encoding KPP or dosdy related molecules may be used to identify 
nucleic acid sequences which encode KPP. Hie spedficity of the probe, wheOier it is made from a 

20 highly specific region, e,g., the 5* regulatory region, or from a less specific re^on, e.g., a conserved 
motif, and the stringency of the hybridization or amplification will detemune whetixer the probe 
id^itifies only naturally occurring sequences encoding KPP, alldic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the KPP encoding sequences. The hybridization probes of the subject 

25 invention may be DNA or RNA and may be derived fcom the sequence of SEQ ID NO.^ or from 
genonuc sequences including promoters, enhancers, and introns of the KPP gene. 

Means for producing specific hybridization probes for polynucleotides encoding KPP include 
the cloning of polynucleotides encoding KPP or KPP derivatives into vectors for tiie production of 
ndlNA probes. Such vectors are known in the art, are commercially available, aid may be used to 

30 synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerases and the 
appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, 
for eKanq>le, by radionnclides such as ^P or ^^S, or by enzymatic lab^, such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotides ^[iGoding KPP may be used for the diagnosis of disord^s associated witii 

35 expression of KPP. Exanq>les of sudi disordm include, but are not linuted to, a cardiovascular 
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disease suA as m^ovemvs fistula, atherosclexosis. hypertension, vascuHtis. Raynaud^, disease, 
aneuiysms, arterial dissections, varicose veins. throiribopUebitis and iMebolhioirbosis. vascular 
tuinors. and coinpHcations of Ihronibolysis. ballooa angioplasty, vascular replacement, and coronary 

arteiy bypass graft surgeiy. congestive heart feilure, ischemic W disease, angina 
5 «^yo«>««al infection, hypertensive W 

valve stenosis, congenitally bicuspid aortic valve, nritral annular calcification, mitral valve prolapse, 
rheamatic fcv« and riieuaiatic heart disease, infective endocarditis, nonbacterial thrombotic 
endocarditis, ei^docarditis of systemic lupus erythematosus, carcinoid heart disease, cardiornyopaflqr. 
n^'ocarditis, pericarditis, neoplastic heart disease, congenital heart disease, and compUcations of 
10 «rfiac transplantation. coi^ta!hiiiganomaUes.atdectasis.puh^ 

pulmonary endwlism. pohnonary hemorrhage, puhnonary infarction, pu^ 

vascular sclerosis, obstructive putanonary disease, restrictive puhnonary disease, chronic obstructive 
putaxaiary disease, emphysenaa. chronic brondiitis. bronchial asfiima. bronchiectasis, bacterial 
pneumonia, viral and nqwoplasmal pneumonia, hag abscess, pidmonary tuberculosis, difliise 
15 intosrtitial diseases. pneurnoconioses,sarc« 

interstitial pneumomtis. iQpperaenativity pneumonia 

obliterans-organizing pneumonia, difiuse puta«inary hemorAage syndromes. Goo^msturei 
«?yiidromes. idiopalhicpuhnonary hemosiderosis, puhnonary invdvement in colhigen-vascu^ 
disorders, puhnonary alveolar protdnosis. hmg tumors, mflammatory and noninflammatory pleural 
20 «^o«is. pneumothorax, ifleural tumors, drug-induced lu^ 

and complications of hmg transplantation; an immune system disorder sudi as acquired 
immunodeficiency syodrome (AIDS). Addison's disease, adult respiratory distress syndrome, 
allergies. ankylosir« spondylitis, anqdoidosis. anonia, asthma, athaosclerosis, autoimmune 
hemolytic anemia, autoimmune thyroiditis. autohnmnnepolyendocrinopaaiorHauididiasis.ectoa 
25 dystrophy (APECED),broncMtis.<aK,lecystitis. contact dermatitis. Crtdm'sdiseas 

demiatomyositis. diabetes meUitus, emplq^cma. c^odic lymphopenia wid> lymphocytotoxins. 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis. Goodpasturi>'s 
syndrome, gout. Graves' disease. Hashimoto's thyroiditis, hypereosinoifliilia. irritable bowel 

syndrome, multiple sclerosis, myastiienia gravis, rn rocardial or pericardial-inflammation. 

30 osteoarthritis, osteoporosis, pancreatitis, polynyositis. psoriasis. Reiter's syndrome, rheumatoid 
arthritis, scleroderma. Sjogren's syndrome, systemic anaphylaxis, systemic hipus erythematosus, 
systemic sclerosis, thrombocytopenic purpura, ulcerative coUtis. uveitis. Werner syndn)mB» 
complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 
parasitic, protozoal, and hehninfliic infections, and trauma; a neurological disorder such as epilq»y. 
35 ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease. Mck's disease. 
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HuntiiigtDifs disease, dementia, Paddi^cm's disease and oflier extrapyramdal disorders, aiiQrotropliic 
latmd sclmisis and ofh^ motor neuron disorders, progressive neural muscular atrophy, ledmtis 
pigmentosa, hieareditary ataxias, multiple scl^osis and ofher dmydinating diseases, bacterial and 
viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
5 tbfttTrihnpblehitis, inyeSitis and radiculitis, viral central ni^ous system disease, prion diseases 
including loiro, Creutzfeldt-Jalsob disease, and Gerstmann-^traussler-Schfiinto syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous Systran, neurofibromatosis, 
tuberous scl^osis. cmbeOoretinalliBmangioblastomatosis, enoephalotrig^annal syndrome, mental 
retardation and olSisr devdqpmental disorders of die central nervous system including Down 
10 syndrome, cerebral palsy, neoroskeletal disordm, autononnc nervons system disordm, cranial nerve 
disorders, spinal cord diseases, muscular ^stropihy and other neuromascular disorders, peripheral 
niOTOtts system disorders, dermatomyositis and polymyositis, inh^ted, metabolic, endocrine, and 
toxic ixQropatbies, icyasthenia gravis, periodic paralysis, mental disorders including mood, anxiety, 
and sduzopbrenic disorders, seasonal affective disorder (SAD), akatfaesia, amnesia, catatonia, 
IS diabedc neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, 
/ Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 

froDtotenq)oral dracentia; a disorder affecting growth and development such as actinic Iceratosis, 
arteriosclerosis, atheroscl^xisis, bursitis, drrhosis, h^atitis, nixed connective tissue disease 
(MCTD), niyelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
20 thrombocythenda, renal tubular acidosis, anemia, Cushing*s syndrome, adiondroplastic dwarfism, 
Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome (Wilms' 
tumor, aniridia, genitourinary abnormalities, and mental retardation), Snuth-Mageius syndrome, 
nqrelodysplastic syndrome, hereditary mucoepithellal dysplasia, hereditary keratod^mas, hereditary 
neuropathies such as Charcot-Mari&-Tooth disease and neurofibrooEiatosis, hypothyroidism, 
25 hydrocephalus, seizuro disordm such as Syndenhantfs chorea and cerebral palsy, spina bifida, 

anencephaly, craniorachisdusis, congenital glaucoma, cataract, and sensorineural hearing loss; a lipid 
disorder such as fatQr liv^, cholestasis, primary biliary drxhosis, carnitine deficiency, carnitine 
palmitoyltransfimise deficimcy, myoadranylate deaminase defidracy, hypertrigl^cmdeaana, lipid 
storage disordm such Fabry's disease, Gauch^'s disease, NienKomrPick's disease; inetachromadc ^ " 
30 leuko(^trophy, adr^ooleukodystrophy, GM2 gai^osidosis, and ceroid lipofbscinosis, 

abetalipoi»oteineima, Tai^er disease, hyperlipoprotdneona, ^betes mdlitus, lipodystrophy, 
lipomatoses, acute panniculitis, dissenmated fat necrosis, adiposis dolorosa, lipoid adrenal 
hyperplasia, tnintnini change disease, lipomas, atherosclerosis, hyperdiolesterol^nia, . 
hypercholest^lCTma with hypertriglycerideotta, primary hypoalphalipoproteioenua, hypothyroidism, 
35 renal disease, liver disease, lecidmKcholesterol acyltransferase deficimcy, cerebrotendinous 
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xanttioxmtosis, sitost^olCTtna, hypoddolesterolmua, Tay-Sacbs disease^ SandhofPs disease, 
hyperlipid^ma, hyperlipemia, lipid inyopafhies, and obesity; and a cdl proliferative disorder such as 
actinic Iceratosis^ arteriosclerosis, ath^t)sclerosis, bursitis, dnhosis, hepatitis, noixed connective 
tissue disease (MCTD), myelofibrosis, paroxysn^ nocturnal hemoglobinuria, polycytheima vera, 

S psoriasis, primaty thrombocyOiemia, and cancers including adenocarcinoma, leukemia, lymphoma, 
melanon^, myeloma, sarcoma, teratocarcinoma, and, in particular, cancels of the adr^ial gland» 
bladder, bone, bone marrow, brain, breast, cervix, colon, gall bladder, ganglia, gastrointestinal tract, 
heart, kidney, liver, lung, muscle, ovaxy, pancreas, parathyroid, penis, prostate, salivary glands, skin, 
spleen, testis, thymus, thyroid, uterus, leukoxsias such as multiple myeloma, and lynqphomas such as 

10 Hodgkin's disease. Polynucleotides ^coding KPP may be used in Southern or noifhem analysis, dot 
blot, or otbrar roenibran&-based technologies; in PCR technologies; in dipstick, pin, and midtiformat 
ELISA-Iike assays; and in nncroarrays utilizing fluids or tissues firompatiCTts to detect altered KPP 
expression. Such qusditative or quantitative methods are weD known in the art 

In a particular embodimBiit, polynucleotides «coding KPP may be used in assays lliat detect 

IS ifaepresCTceof associated disorders, particularly those mentioned above. Polynucleotides 

conqdfflmtaiy to sequences encodiiig KPP may be labeled by standard methods and added to a fluid 
or tissue sample from a patient xmdsr conditions suitable for tiie fornuition of hybridization 
conq>lexes. After a suitable incubation period, the sarqde is washed and the signal is quantified and 
conqpared with a standard value. If the amount of signal in the patient sample is significantly altered"" 

20 in conqparison to a control sanq>le tiien the presence of altered levels of polynucleotides encoding 

KPP in tile sample indicates tiie presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular th(»speutic treatment reginoi^ in animal studies, in climcal trials, 
or to inonitor tiie treatnoBnt of an individual pati^ 

in ord^ to provide a basis for the diagnosis of a disorder associated with eacpression of KPP, 

25 a normsA or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or htunan, with a sequence, or a 
fragnn^t thereof, encoding KPP, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by conq>aring the values obtained from normal subjects 
with values from an experiment in which a known amount of a substantially purified polynucleotide 

30 is used. Standard values obtained in this manner may be con^ared with values obtained from 

saiqples from patients who are symptomatic for a disorder. Deviation from standard values is used to 
establish the presence of a disord^. 

Once the presence of a dSisorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to detenmne if tiie level of expression in the 

35 patient begins to approximate tiiat which is observed in the normal subject The results obtained from 



70 



pp.1722 P 



successive assays nmy beused to show fhe Efficacy of treatm^ over a pmod raiigiBg from several 
days to monflis. 

With respect to canc^, the presrace of an abnormal amount of transcript (dfluu' under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition ibr the 

5 development of the disease^ or may provide a means for detecdiig the disease prior to the appearance 
of actual clinical synqitoms. A nioredefiiutive diagnosis of this ^e may aflow health piofe^ 
to CToploy preventative measures or aggressive treatm^ earlier, th»^y preventing die development 
or furdier progression of die canc^. 

Additional diagnostic uses for oligonucleotides designed from fhe sequences ocodiog KPP 

10 may involve the use of PCR. These oligomers may be chemically synthesized, generated 

enzymatically » or produced in yitro. Oligomers win preferably contain a fragment of a polynucleotide 
encoding KPP, or a fragment of a polynudeotide complementafy to die polynudeolide encoding 
KPP, and will be oqployed und^ optindzed conditions for id^itification of a spedfic gene or 
condition. OUgomers inay also be enoplqyed und^ less stringent conditions fixr detecd 

IS quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide prim^ derived frx>m polynucleotides encoding KPP 
may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions 
and deletions that are a frequent cause of inherited or acquired genetic disease in humans. Me&ods 
of SNP detection include, but are not limited to, single-*stranded conformation polymorphism (SSCP) 

20 and fluorescent SSCP (fSSCP) n^thods. In SSCP, oligonucleotide primers d^ived from 

polynucleotides encoding KPP are used to an:q)lify DNA using the polymerase chain reaction (PCR). 
The DNA may be derived, for example, from diseased or normal tissue, biopsy sauG^les, bodily fluids, 
and the like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR 
products in single-stranded fonn, and these differences are detectable using gel electrophoresis in 

25 non-denaturing gels. In fSCCP, the oligonucleotide primers are fluorescentiy labded, which allows 
detection of the an^limers in high-throughput equipment such as DNA sequencing machines. 
Additionally, sequence database analysis methods, termed in silico SNP (isSNP), are capable of 
identifying polymorphisms by comparing the sequence of individual overlapping DNA fragments 
whidi assCTble into a common consensus sequ^ce. These conoputer-based melhods filt^ out ' 

30 sequence variations due to laboratory 'preparation of DNA and sequencing errors usir^ statistical 
models and autonoated analyses of DNA sequence chromatograms. In the alternative, SNPs may be 
detected and diaracterized by mass spectrometiy using, for exanople, the hi^ throu^put 
MASS ARRAY system (Sequenom, Inc., San Diego CA). 

SNPs may be used to stu<fy tiie genetic basis of human disease. For 6xanq>le, at least 16 

35 cononiDn SNPs have been associated with non-insolin-^q»endent diabetes m^ SNPs are also 



71 



EP-1722P 

useful for exanrining cKfiferaices in disease ontcomBS in monogeiiic disorders, sadi as cystic fibrosis, 
sickle ceH anenna, or chronic gramilomatoiis disease. For e>Lan[q>le, variants in die mannose-binding 
lectin, MBL2 , bave b een shown to be correlated widi dejeterions puhnonaxy outcomes in cystic 
fibrosis. SNPs also have utDi^ in phannaco^somics, the identification of genetic variants that 

5 influence a patient's response to a drug, such as life-flireat^uQg toxicity. For exanople, a variation in 
N-acetyl transferase is associated with a high incidence of peripheral neuropathy in response to the 
anti-tubeiculosis drug isoniazid, while a variation in the core promoter of the ALOXS gene results in 
dinnnished clinical response to treatmrait widi an anti-asthma drug that targ^ die 5-lipoxygenas6 
pathway. Anal^isof the distribution of SNPs in different populations k 

10 graetic drift, mutation, recombination, and selection, as well as for tracing the origins of populations 
and dieir migrations (Ta^or, J.G. dt al. (2001) Trends MoL Med. 7:507-312; Kwok, P.-Y. and Z. Gu 
(1999) Mol. Med. Today 5:538-543; Nowotny, P. et aL QOOl) Curr. Opin. NeurobioL 1 1:637-641). 

Methods which ntKiy also be used to quantify the expression of KPP include radiolabeling or 
biotinylating nucleotides, coaiiq>lification of a control nucleic acid, and interpolating results from 

15 standard curves (Melby, P.C. et aL (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et al. (1993) 
Anal. Biochem. 212229-236). The speed of quantitation of multiple sairqples may be accelerated by 
running the assay in a hi^-throughput format where the oligomer or polynucleotide of inter^t is 
presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

20 In further embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotides described h^^in may be used as elements on a microarray. Hie microarray can be 
used in transcript imaging techniques which monitor the relative expression levels of large numbers 
of genes simultaneously as described below. The microarray may also be used to id^tify g^ietic 
variants, mutations, and polymorphisms. This information may be iised to detemcdne gene function, 

25 to understand tiie genetic basis of a disord^, to diagnose a disorder, to monitor 

progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of th^apeutic agents in the treatm^ of disease. In particular, this information may be used 
to develop a piharmacogmomic proi3e of a patient in ord^ to select the most appropriate and 
eflkctive treatment regmom for tiiat patient For exanqde,fhmpeutic agents wMc^arehiigbiy^^ " 

30 effective and display the fiswest side effects may be selected for a jmtient based onbis/h^ 
pharmacogenonuc profile. 

In another enibodiment, KPP, firagm^its of KPP, or antibodfies specific for KPP may be used 
as elenoents on a ndcroarray. Hie nucroairay may be used to monitor or measure protein-protdn 
interactions, drug-target interactions, and g^ expression profiles, as described above. 

35 A particular embodiment relates to the use of the polynucleotides of the present invention to 
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generate a transcript image of a tissue or ceD type. A transcript image represrats flie global pattern of 
gene expression by a particular tissue or ceD type. Global gene expiession pattems are analyzed by 
quantifying flie number of expressed genes and their relative abundance und» giv« conditions and at 
a given time (Seilhamex et al., "Con9)arative Gcdq Transcript Analysis/' U.S. Patent No, 5,840,484; 
b^eby express^ incorporated by refwence herein). Hius a transcript Image may be genoated by 
hybridizing the polynucleotides of the present invention or their conqjleomits to the totalis of 
transcripts or reverse transcripts of a particular tissue or ceU ^e. In one embodiment, the 
hybridization takes place in high-throughput format, wherein die polynucleotides of the presot 
invention or tiieir complements conqjrise a subset of a phirality of d^mnts on a nncroarray. The 
resultant transcript image would provide a profile of gme activity. 

Transcript images may be generated using transcripts isolated firom tissues, cell lines, 
biopsies, or otiier biological san^les. The transcript image may thus reflect gene expression in vivo, 
as in the case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile tiie expression of the polynucleotides of the present 
invention may also be used in conjunction witii in vitro model systems and preclinical evaluation of 
pharmaceaticals, as well as toxicological testing of industrial and naturaHy-occurring environmental 
compounds. AH con^Miunds induce characteristic gene expression patterns, irequently termed 
molecular fipgerprints or toxicant signatures, vMdt are indicative of mechanisms of action and 
toxicity (Nuwaysir, RR et al. (1999) Mol. Carcinog. 24:153-139; Steiner, S. and N.L. Anderson 
POOO) Toxicol. Lett 112-113:467-471). If a test conopound has a signature similar to fliat of a 
coiiqwiund with laaown toxicity, it is lifceiy to These fii^gwprints or 

signatures are most useful and refined when they contain expression information ftom a large number 
of genes and gene families. Ideally, a ^ome-wide ruBasurement of expression provides the highest 
quali^ signature. Even genes whose expression is not altered by any tested coxnpounds are irnportant 
as wen, as die levds of expression of these genes are used to normalize the rest of the expression 
data. The normalization procedure is useful for comparison of expression data aftw treatment with 
different compounds. WhSetheassignmi^of gene ftuiction to elements of a toxicant sig^ 
in interpretation of toxicity mechanisms, knowledge of gene function is not necessary for die 

statistical nmtdungofsig|Mtures\?iW<Aleads to prediction of tori^^ 

Release 00-02 firan die National Institute of Environmratal Healtii Sciences, rdeased February 29. 
2000, available at niehs.nih.gov/oc/news/toxchip,hln:d. Therefore, it is hoportaiA and desirable in 
toxicological screening using toxicant signatures to include all expressed g^ sequraces. 

In an embodiment, the toxicity of a test compound can be assessed by treating a biological 
sample containing nucleic acids wifli the test compound. Nucleic adds that ate expressed in die 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of 
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the present inv^ifioii, so that transcript levels conespcmding to the polynucleoddes of die present 
invention may be quantified. Hie transcript levels in the treated Inologtcal sample are compared vnOx 
levels in an nntreated biolo^cal sanqile. Differences in die transmpt levels between the two samples 
are indicative of a toxic response caused by die test conqpound in the treated sanoqple* 

S Another enfljodimsnt relates to the use of die polypqitides disclosed to anaityze the 

proteome of a tissue or cefl type. The term proteome refers to the global pattern of protein expression 
inaparticolar tissue or cell type. Each protdnconqpon^ of a proteome can be subjected 
individually to furdief analysis. Proteome expressum patterns, or profiles, are analyzed by 
quantifyingtheiniinberof esqiressed proteins and diek and 

10 atagiventime. A profile of a ceD*s proteome may dms be generated by sq>arating and analyzing the 
po^pqptides of a particular tissue or cell type. In one embodiment, the separation is achieved using 
two-dimensional gel electrophoresis, in which proteins from a sanq>le are sqiarated by isoelectric 
focusing in die first dimension, and then according to molecular weight by sodium dodecyl sulfate 
slab gel electrophoresis in the second dimension (Steiner and Anderson, supra}. The protdns are 

IS visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an 
agent such as Coomassie Blue or silver or fluorescent stains. The optical d^isity of each protein spot 
is generally proportional to the level of the protein in the sanqfle. The optical densities of 
equivalendy positioned protein spots from diffident samples, for exanqile, from biological san9>les 
either treated or untreated with a test corcqpound or therapeutic agent, are co0^)ared to identify any 

20 changes in protein spot density related to the treatm^t The proteins in the spots are partially 

sequenced using, for example, standard methods employinjg chemical or ^ozymatic cleavage followed 
by mass spectrometry. The identity of the protein in a spot may be determined by conq)aring its 
partial sequence, preferably of at least 5 contiguous anuno acid residues, to the polypeptide sequences 
of interest. In some cases, further sequoce data noay be obtained for defiin 

25 A proteonnc profile may also be g«jaated using antibodies spedfic for KPP to quantil^ die 

levels of KPP expression. In one embodiment, the antibodies are used as dCToents on a microarray, 
and protein expression levels aro quantified by contacting the ndcroarray widi the san^le and 
detecting the levels of protein bound to each array el^nent (Lueking, A. et al. (1999) Anal. BiochemL 
270:103-1 1 1 ; Mendoze, L.G. et al- (1999) Biotechniques 27:778-788). Detection may be i^rformed 

30 by a variety of methods known in die art, for exanqde, by reacting the protdns in die sample with a 
thiol- or amino-reactive fluorescent cornpound and detecting the amount of fluorescence bound at 
each array dement 

Toxicant signatures at the proteome level aro also useftil for toxicological scre^iinig, and 
should be aimlyzed in paraUd with toxicant dgiiatures at the transcript levels There is a poor 
3S correlation b^een transcript and protdn abundarK:es for some proteins in some tissues (Andmon, 
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N.L. and J. Sdlhamer (1997) Electioplioxesis 18:S33-S37)» so piotecxiw toxicant signatures may be 
useiid in llie analysis of conopounds which do not significantly affect the transcript inaage, but ^dncb 
alt^liieproteondc profile. In addition, the analysis of tninscripts in bodyfioids is difiBcu^ 
rapid degradation of mRN A, so proteoomc profiling may be more reliable and informative in mch 
5 cases. 

In another raibodiment* die toxicity of a test compound is assessed by treating a biological 
sanq)Ie containing protons wid&fhe test concpound. Proteins Ibat are expressed in the treated 
biidogicsd sanqfle are sqxarated so that the amount of eadiprot^ Hieamountof 
each protein is cornpared to the amount of the corresponding protein in an untreated biological 

10 san^le. A iSfFmnce in tbe amount of piotein between the two sanqdes is 

response to tiie test connpound in the treated sample. Individual protdns are id^itified by sequenciog 
the ansino acid residues of tbe individual protons and conqiaring these partial sequences to the 
polypeptides of the present invention. 

In another enibodiment, the toxicity of a test conqiound is assessed by treating a biological 

15 sanq>le containing proteins with the test compound. Proteins from the biological samplo are 
incubated with antibodies specific to the polypeptides of the present invoition. Ibe anoount of 
protein recognized by the antibodies is quantified. The amount of protein in the treated biolo^cal 
saiqile is conq)ared with the amount in an untreated biological sample. A difference in ^e amoimt of 
protein between Ae two san^les is indicative of a toxic response to the test compound in the treated 

20 sanq>le. 

Microarrays may be prepared, used, and analyzed using methods known in the art (Brennan, 
T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. USA 
93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 16; Shalon, D. et al. (1995) 
PCT application WO95/35505; HeHer, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150-2155; 

25 Hener, M.J. et al. (1997) U.S. Patent No. 5,605,662). Various types of nmcroarrays are well known 
and thorou^dy described in Scbena, M., ed. (1999; DNA Microairavs: A Practical Approach . Oxford 
University Press, London). 

In anodier embodim^ of the inv»lion» nuddc acid sequences encoding,KPP may be used 
to generate bybritSzation probes usefid in mappiog tite.natundly occurrite genonnc seguence;^ Hthet 

30 coding or noncoding sequences may be used, and in some instances, noncodiag sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among menibers 
of a midti-gene famity may potentially cause undesired cross hybridization during chromosonoal 
mapping. The sequences may be mapped to a particidar chromosome, to a specific 
chromosome, or to artificial chromosome constructions, e«g., human artificial chromosomes (HACs), 

35 yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
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constructions, or single chromDsonB cDNA libraries ^anington, JJ. et al. (1997) Nat Genet 
15:345-355; Price. CM. (1993) Blood Rev. 7:127-134; Trask. B.J. (1991) Tttajds Genet 7:149-154). 
Once mapped, the nudeic add sequences noay be used to devdop genetic linkage maps, for exan^le, 
wWdi correlate the inheritance of a disease state with the inheritance of a particular chromosome 
5 region or restriction fragment length polymorphism (RFLP) (Lander. E.S. and D. Botstdn (198^ 
PW>c. Natl. Acad. Sci. USA 83:7353-7357). 

Ruorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data (Hdnz-Ulrich, et aL (1995) in Meyers, supra, pp. 965-968). Examples of genetic map data 
can be found in various sdentific journals or at the Online Mearfdian Inheritance in Man (OMIM) 
10 World Wide Web site. Corrdation between the location of the gene encoding KPP on a physical map 
and a specific disoider, or a predisposition to a spedfic disorder, may hdp define the region of DNA 
assodated wifli Oiat disordra- and dins may farther positional cloning efforts. 

In situ hybridization of chromosomal prq>arations and physical mapping techniques, sudi as 
linkage analysis using established duomosomal markers, may be used for extending genetic maps. 
15 Often flieplacanMit of a gei» on the dttomosomB of another manomaUan species, su^^ 

naay reveal assodated markers even if flie exact chromosomal locus is not known. This information 
is vahiable to investigators seaidiing for disease g^ using positional doning or other gene 
discovery tediniques,^Once Bie gene or gpnes responsible for a disease or syndrome have been 
cmddy localized by genetic liiikage to a particular genomic region. e,g., ataxia-tdangiectasia to 
1 1^-23, any sequences mapfai^ to that area may xeptesaO. assodated or regulatory gcaes for 
farther investigation (Gatti. R.A. etal. (1988) Nature 336:577-580). The nudeotide sequence of the 
instant invention may also be used to deted diffisrences in the duomosomal locate 
translocation, invasion, etc., among normal, catria, or affected individuals. 

In another enfl^odiment of the invaition, KPP. its catalytic or immunograiic fragments, or 
digopeptides fhoeof can be used for screcaiing libraries of conq>oui^ in any of a vari^ of drug 
screening tedmiques. ThBtogmenten?floyBdinsud»screaaiugmaybefteeinsohilion.afBxedtoa 
soUd support, borne on a cdl snifoce, or located mtraceflularly. The formation of binding complexes 
b^een KPP and the ag^ being tested may be uKasured. 

Anoth» tedmique for drug screening.provides for hi^ flirou^t screemnff of com^^ 
30 having suitable buidmg affinity to the protein of interest (Geysen. et al. (1984) PCT application 

WO84/03564). In this method, large naniyets of different small test con^oands are synthesized on a 
solid substrate. The test compounds are reacted with KPP, or fragments fliereof, and washed. Bound 
KPP is then detected by methods well known in the art. Purified KPP can also be coated directly onto 
plates for use in the aforementioned drug screening tedmiques. Alternatively, non-neutralizing 
antibodies can be used to capfare the p^Ude and imnaobilize it on a solid support 
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In anoflier embodunent, one may use conqjetitive drag screening assays in which neutralizmg 
antibodies capable of binding KPP specifically co^^>ete with a test conqwiund for binding KPP, In 
this mann^, antibodies can be used to detect tb& presence of any peptide which shares one or more 
antigenic determinants with KPP. 
5 In additional enDtbodim^, the nucleotide sequences vAnck encode KPP may be used in any 

molecular hiology techniques that have yet to be developed, provided flie new techniques rely on 
properties of nucleotide sequoices that are cuirenfly known, including, but not limited to, such 
properties as die triplet genetic code and specific base pair interactions. 

Wifliout further elaboration, it is believed fliat one skilled in the art can, using fee preceding 
10 description, utilize flie present invention to its fuflest extent The following embodiments are, 

therefore, to be construed as merely illustrative, and not Umitati ve of the remainder of the disclosure 
in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, are 
e^ressly incorporated by reference herein. 

15 

EXAMPLES 

!• Construction of cVNA Libraries 

Incyte cDNAs are derived fixm cDNA libraries described in the LIFESEQ database (Incyte, 
Palo Alto CA) and shown in Table 4, column 3, Some tissues are homogenized and lysed in 

20 guanidinium isofliiocyanate, while others are honngenized and lysed in phenol or in a suitable 

nmxture of denaturants, such as TREOL (Invitrogen), a monophasic sohition of phenol and guamdine 
isothiocyanate. The resulting lysates are centrifuged over CsQ cushions or extracted with 
chloroform. RNA is precipitated from the lysates with dfbcr isopropanol or sodium acetate and 
ethanol, or by other routine methods. 

25 Phenol extraction and precipitation of RNA are zepeatcd as necessaiy to increase RNA purity. 

In some cases, RNA is treated with DNase. For n»st libraries, poly(A)+ RNA is isolated usmg oligo 
d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QL\GEN, ChatsworOi 
C A), or an OUGOTEX niRNA purification kit (QIAGEN). Alternatively, RNA is isolated direcfly 
from tissue lysates using other RNA isolationJdts, e.g., the POLY(A)PURE niRNA purification kit — 

30 (Ambion, Austin TX). 

In some cases, Stratagene is provided with RNA and constructs the corresponding cDNA 
libraries. Otherwise, cDNA is synthesized and cDNA libraries are constructed wifli the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Invitrogen), using the recommended 
procedures or simflar methods known in the art (Ausubel et al., supra, ch. 5). Reverse transcription is 
35 initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters are ligated to double 
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straoidedcDNA, and die cDNA is digested with Oeappro^ for 
nxist Hbraiies, llie cDNA is size-sdected (300-1000 bp) usk^ SEPHACRYL SIOOO, SEPHAROSE 
CL2B, or SEPHAROSE CL4B cohmni chroaatography (Amecsham Biosdraces) or prq>arative 
agarose gd electrophojresis. cDNAs axe Cg^ted into compatible lestriction eazym& sites of the 
5 polyliid;^ of a suitable plasnnd, e.g., PBLUESCRIPT plasond (Stratageoe). PSPORTl plasnnd 

(Iixvitroge3i» Carlsbad CA), PCDNA2.1 plasnud Obmtrogqn), PBK-CMV plasmid (Stratagene), PCR2- 
TOPOTA plasmid Qimtrosm), PCMV-ICIS plasmd (Stratag^), pIOEN (bicyte, Palo AItt> CA), 
pRARB (iDcyte), or pINCY Qncyte), or derivatives dieieof. Recominnaiit plasimds are transformed 
. into competent R coU odis iochiding XLl-Bfoe, XLl-BlueMRP, or SOLR fitom Stratagene or DH5a, 
10 DHIOB, or ElectToMAX DHIOB fiomlhvitrogra. 
n* Isolation of cDNA Clones 

Plasmids obtained as described in Exan^le I are recovered firomhost cells by in vivo exdsion 
using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids are purified using at least 
one of the foUovving: a Ma^c or WI21ARD Minipr^s DNA purification system (Promega); an AGTC 
15 Minipiep purification kit (Edge Biosystems, Gaifhersburg MD); and QIAWELL 8 Plasnfid, 

QIAWELL 8 Plus Plasnid, QIAWELL 8 Ultra Plasmid purification systems or the R.E. AL. PREP 96 
. plasndd purification kit from QIAGEN. Following precipitation, plasnnds are resuspended in 0.1 ml 
of distDled water and stored, with or without lyophiliza1ion« at 4''C. 

Altmiatively, plasmid DNA is anq)lified from host cell lysates using direct liok PGR in a 
20 higji-througbput format (Rao, V.B. (1994) AnaL Biocbem. 216:1-14). Host cell lysis and thermal 
cycling steps are carried out in a single reaction nnxture. Samples are processed and stored in 384- 
well plates, and the concentration of an^lified plasmid DNA is quantified fiuorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN n fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 
25 III* Sequencing and Analysis , 

Incyte cDNA lecovered in plasmids as described in Example n are sequenced as follows. 
Sequencing reactions are processed using standard mefliods or high-throughput instnunentation such 
as &e ABI CATALYST 800 (Applied Biosystenos) ttiennal cycler or the PTC-200 thermal cycler (MJ 
Research) in conjunction y/idi the HYDRA microdispens^ (Robbins Scientific) or the MICROLAeT 
30 2200 OHamilton) liquid transfer system* cDNA sequencing reactions are prepared using reagents 
piovided by Amerslham Biosciences or supplied in ABI sequendng kits such as die ABI PRISM 
BIGDYE Tmmnator cycle sequmcing ready reaction kit (Applied Biosystems). Electrophoredc 
separation of cDNA sequencing reactions and detection of labeled polynucleotides are carried out 
usiD^ the MEGABACE 1000 DNA sequencing system (Amersham Biosciences); Oie ABI PRISM 373 
35 or 377 sequencing system (Allied Biosystems) in conjunction with standard ABI protocols and base 
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cdlioe s^'^^^*'^' ^'^^ ^^Quci^ SUKQ^^ ReadiogfiraooBsmttuiifhe 
cDNA seqaences are ideaitified using standard xnetliDds (Ansobd et al., supra^ ch. 7). Some of flie 
cDN A seqaences are selected fas CTtendon nsipg the teduuqaes disclosed in Bxan^le VIIL 

. Polyniicleotide seqaences derived fonilnc^cDNAs aievalidatedbyi^noving vector, 
5 linker, and poly(A) sequences and by maglfing anibigaous bases, using algpriOuns and programs 
based on BLAST, dyoanac programming, and dinacleotide nearest n^gbbor analysis. The Incyte 
cDNA seqaences or translations thereof are dien queried against a selectkm of public databases such 
as tibe GenBank primate, rodeot, mammalian, v^tebrate, and eukaryote databases, and BLOCKS, 
PRINTS, DOMO, PRODOM; PROTEOME databases ^lli sequfflces bomHomo sapiens^ Rattus 
10 notvegioiSy Mus musculus^ Caenorhabditis elegans, Saccharomyces cerevisiae, Schtzosaccharomyces 
pombe» and Candida albicans QnicytB, Palo Alto CA); bidden Markov model (HMMQ-based protdn 
fsarSty databases such as PPAM, INCY, and TIGRFAM Olait, D.H. et al. (2001) NucMc Adds Res. 
29:41-43); and HMM-based protdn domain databases sudi as SMART (Schultz, J. et al. (1998) Proc. 
NatL Acad. ScL USA 95:5837-5864; Letunic, L et al, (2002) Nucldc Adds Res. 30:242-244). 
15 (HMM is a probabilistic appioacb wbicb analyzes consensus primary stmctures of gene families; see, 
fi>r example, Eddy, S.R. (1996) Cunr. Opin. Stmct BioL 6:361-365.) The queries are performed 
uang programs based on BLAST, FASTA, BLIMPS, and HMMER. The Incyte cDNA sequCTces are 
assennbled to produce full lengib polynucleotide sequences. Alternatively, GenBank cDNAs, 
GenBank ESTs, stitched sequences, stretched sequences, or Genscan-predicted coding sequences (see 
20 Exanq)les IV and V) are used to extend Incyte cDNAassenAlages to full length. Asseniblyis 

performed using programs based on Phred, Phrap, and Consed, and cDNA assemblages are scrcCTed 
fox open reading frames using programs based on OeneMark, BLAST, and PASTA Ibe full length 
polynucleotide sequences are translated to derive the corresponding full length polyp^tide 
sequences. Alternatively, a polypeptide may be^ at any of die methionine residues of the full lenglli 
« 25 translated polyp^tide. FuU length polypeptide sequences are subsequently analyzed by querying 
against databases such as the GenBank protein databases (genpept), SwissProt, the PROTEOME 
databases, BLOCKS, PRINTS, DOMO, PRODOM, Piosite, hidden Markov model (HMM)-based 
protein family databases such as PFAM, INCY, and TIGRFAM; and HMM-based prot^ domain 
databases sudi as SMART. FoH length polynudeotidesequCTces are also andyzedi^ 
30 MACDNASIS PRO software (MuraiBio, Alameda CA) and LASERGENE software (DNASTAR). 
Polynucleotide and polypeptide sequence alignmmts aie generated usipg default parameters specified 
by the CLUSTAL algorithm as incorporated into Oie MEGAUGN miiltisequence aUgnment program 
^NASTAR), vliidh also calculates the percent identity between aligned sequences. 

Table 7 summarizes tools, programs, and algorithms used for the analysis and assembly of 
35 Incyte cDNA and fidl lengSi sequences and provides applicable descriptions, references, and 
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threshold parametBis. The fiist cohmm of Table 7 shows the tools, programs, and algoiilhms used, 
the second cohmm provides brief descriptions fliereof, the third cohamn presents appropriate 
refiarences. all of whidi are incorporated by refwence herdn in their ratirety, and the fourth cohmm 
piesenls, vAieie applicable, flie scores, probability vahies, and other parameters used to evahiate fbo 
5 strength of a match between two sequences (the iug^a (ba score or the lower die probability value, 
Qie greater the identity between two sequences). 

The programs d^cribed above for the assembly and analysis of full leiigtb polynucleotide 
and po^ptide sequences are also used to idraitify polynucleotide sequraice fragnents from SEQ n> 
NO:2. FragnoMJts from about 20 to about 4000 nucleotides whidi are useful in Itybridization and 
10 amplification tedmplogies are described in Table 4, column 2. 

IV. Idttitification and Editing of Coding Sequences from Genomic DNA 

Putative Idnases and phosphatases are initially identified by running the Genscan gene 
identification program against public genonmc sequraice databases (e.g., gbpri and gbhtg). Gaiscan is 
a ^neral-pnipose ^ne identification program yrldcb. analyzes genome DNA sequences from a 
IS variety of oiganisms (Burge. C. and S. Kailin (1997) J. MoL BioL 268:78-94; Burge, C. and S. Karlin 
(1998) Curr. Opin. Struct Biol. 8:346-354). Hie program concatenates predicted exons to form an 
assrad)ledcDNA sequence fflOending from a mettuonine to a stop codim. Thcoutputof Genscan is a 
PASTA database of po^nmdeotide and polypeptide sequraices. Themaximunrange of sequence for 
Goiscan to analyze, at once is set to 30 Itb. To detonme which of these Genscan prefficted cDNA 
20 sequences encode Idnases and phos^iatases, the encoded poty^ 

against PFAM models for Idnases and phosphatases. Potential kinases and phosiAatases are also 
identified by homology to Incyte cDNA sequences that have becm annotated as Idnases and 
phosphatases. Hiese selected Gemscan-predicted sequences are then con^arcd by BLAST analysis to 
flie gmpept and gbpri public databases. Whoe necessary, die Gcaisean-predicted sequences are tiien 
25 editedbycon5)arisontolhetopBlJ^Thitftomgqnpq?ttocoim:tern>rsinthBseqnence^ 

byGMiscan.sudias«traoronrBttedcaons. BLAST analysis is also used to find any bicyte cDNA or 
pubUc cDNA cov«age of the Genscan-predicted sequences, tins providing evidence fi>r transcription. 
When Incyte cDNA coverage is available, tiiis information is used to correct or confirm tiie Genscan 

predicted sequence. lei^ polynucleotide sequences are obtauKsd by assonbliiie Genscan--^ 

predicted coding sequences witii Incyte cDNA sequences and/or public cDNA sequences using die 
assembly process described in Exaniple m. Altematively, full lengtii polynucleotide sequences are 
derived entirely from edited or unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 

"Stitched" Seq iignpi.g 

Partial cDNA sequeaices are extended wifli exons predicted by die Genscan geaie 
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identification program described in EKanqfle IV. Paitial cDNAs assembled as described in B&anQ>le 
m ate mapped to g«oiiac DNA and parsed into clustm contaimng related cDNAs and Geoscan exon 
predictions firom one or noore^Domic sequences. Eacb cluster is analyzed using an algoritibmbased 
on gcaph theory and d^naiac progranoimiig to integrate cDN A and genoimc infonmtion, g^omting 
5 possible splice variants tbat are subsequendy confirmed, edited, or extended to <^eate a full lei^glh 
seqpence. Sequence integrals in ^vbichlbe entire lengQi of die inte^ 

sequence in the cluster are identified, and int^vals thus idmtified are considered to be equivalent by 
transitivity. For exanqple, if an interval is present on a cDNA and two genonmc sequences, tl^ 
three int^als are considered to be equivalent Ibis process allows unrelated but consecutive 

10 genonnc sequences to be brought together, bridged by cDNA sequence. Intravals thus identified are 
then "stitched" together by the stitching algorhhmin the order that they appear along thrir parmt 
sequences to generate the loiigest possible sequoice, as well as sequrace variants. Linkages betweesa 
intervals which proceed alopg one type of parent sequence (cDNA to cDNA or genomic sequ^oce to 
genomic sequence) are given preference over linkages which change parent type (cDN A to gmondc 

15 sequence), llie resultant stitdied sequences are translated aMconq)aied by BLAST aiialys^ 
genpept and gbpri public databases. Incorrect exons predicted by Genscan are corrected by 
conq>arison to the top BLAST hit from g^npqpL Sequ»K:es are further extended vnth additional 
cDNA sequences, or by inspection of gmonmc DNA, when necessary. 
**Stretchcd^^ Sequences 

20 Partial DNA sequences are extended to full length with an algorithm based on BLAST 

analysis. First, partial cDNAs assembled as described in Exanq>le m are qu^ed against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog is then con^ared by BLAST 
analysis to dither Incyte cDNA sequences or GenScan exon predicted sequences d^cribed in 

25 ExanqdelV. AcAdmericprotehiisgenmtedbyusingtheresidtanthi^-scoringsegn^ 

CHSPs) to map the translated sequraces onto the GenBank protdn homolog. Insertions or deletions 
may occur in die chimeric protem with respect to the original GenBank proton homolog. Ibe 
GenBank protein homolog, die chimeric protdn, or both are used as probes to searcli for homologous 
genomic sequences from the public human gCTome databases.. Partial-DNA sequences are therefore 

30~ "stretdtied** or extraded by the addition of homologous genomic sequw The resultant stretched 
sequences are examined to detemnne whether they contain a con^lete gme. 
VI. Chromosomal Mapping of KPP Encoding Polynucleotides 

Ihe sequences nsed to assenible SEQ ID NO:2 are compared with sequences fiom flie Incyte 
LIFESBQ database and public domain databases using BLAST and other implementations of the 

35 Smith" Waterman algorithm. Sequences from these databases diat matched SEQ ID NO:2 are 
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assenfliled into dostieis of condguous and ovedqpping sequences using assembly algoriflmis such as 
Fhrap (Table 7). Radiation hybrid and genetic mapping data available ftom public resources such as 
the Stanford Human G^me Center (SHGC)» Whitdiead Institute for Genome Research (WIGR), 
and G6i6thon are used to detemmie if any of the clustered sequences have been previously mapped. 

S Inclusion of a mapped sequence in a cluster results in Ihe assignment of all sequences of that cluster, 
including its particular SEQ ID NO:» to fliat map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in ceniiMorgans, is measured relative to ttie temnnus of die chromosome's p- 
aim. (The centiMorgan (cM) is a unit of measurement based on reconibination finequencies between 

10 duomosomal markers. On average, 1 cM is rougUy equivalent to 1 megabase (Mb) of DNA in 
humans, although diis can vaiy widely due to hot and cold spots of recombination.) The cM 
distances are based on genetic madcers mapped by G6n£tiion which provide boundaries for radiation 
( hybrid maikers whose sequences were included in each of tire clusters. Hunxm genome maps and 
otiier resources avmlable to the public, such as the NCBI ''GeiueMap*99" World Wide Web site 

IS (ncbi.nlm.nih.gov/genem2{>/)» can be erqployed to determine if previously identified disease genes 
* rnap within or in proxirnity to tile intervals indicated above. v 
VII. Analysis of Polynucleotide Eiqpre^ion 

Northern analysis ts a laboratory technique used to d^ect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 

20 from a particular cell Qrpe or tissue have been bound (Sarnbrook and Russell, supra^ dx. 7; Ansubel et 
aL, suprUy ch. 4). 

Analogous computer techniques applying BLAST are used to search for identical or related 
molecules in databases sudi as G^iBank or LDPESEQ (Incyte). This analysis is much fast^ than 
multiple membrane-based hybridizations. In addition, the sensitivity of die coiiq>uter search can be 
25 modified to detemune whether any particular match is categorized as exact or similar. The basis of 
the seardi is the product score, which is defibaed as: 

BLAST Score x Percent Identity 

5 X fiiininium {length(Seq. 1), lengtii(Seq; 2)) 

30 

The product score takes into account botii Ihe degree of sindlarity between two sequences and the 
leogtii of die sequence match. The product score is a normalized value betwemO and 100, and is 
calculated as follows: the BLAST score is rxadtiplied by the percent nudeotide identity and the 
product is divided by (5 times the l^igth of the shorter of Ihe two sequences). Ihe BLAST score is 
35 calculated by assigmng a score of +S for ev^ base that matches in a high-scorii^ segment pair 
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(HSIOf aiid-4fbreveiyndsniatc]i. Two seqamces may share iddjs than one HS^ 
gaps). If thm is more than one HSP, then tbe pair vnSi the highest BLAST score is used to calculate 
the product score. The product score represeots a balance between fractional overlap and quaUty in a 
BLAST alignmrait For exanq>le, a product score of 100 is produced on^ for 100% ideaHity over the 

5 ^itirelraglhof the shoites* of the two sequ&Qces being ocm^ A product score of 70 is produced 
edflierby 100% idaitily and 70% ovedap at one end, or by 88% identity and 100% overlap at the 
odier. A product score of 50 is produced dlthtf by 100% id»tity and 50% ovearlap at one end, or 79% 
identity and 100% ov^lap. 

Altmuitively, polyoudeotides encoding KPP are analyzed m 

10 from wbich they are dedved. Forexan^ile, some fuBlragthsequemres are assenfliled, at least in p 
with overlapping Incyte cDNA sequences (see Example m). Each cDNA sequCT^ce is derived firom a 
cDNA library constnicted from a liunian tissue. , Each bunian tissue is clasdfied into one of the 
fiDllo^xuig organ/tissue categories: cardiovascular systejn; connective tissue; digestive systen^ 
eoibryomc structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; germ 

IS cells; hennc and immune systmi; liver; musculoskeletal systmi; nervous system; pancreas; 

respiratory syst^ sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract 
The number of libraries in each category is counted and divided by the total number of libraries 
across all categories. Sinularly, eadi human tissue is classified into one of the following 
disease/condition categories: cancer, cell line, developmental, inflammation, neurological, tramm, 

20 cardiovascular, pooled, and other, and the nunoiber of libraries in each category is counted and divided 
by the total number of libraries across all categories. The resulting percentages reflect the tissue- and 
disease-specific expression of cDNA encoding KPP. cDNA sequences and cDNA library/tissue 
information are found in the LIFESEQ database (Incyte, Palo Alto CA). 
VIIL Extension of KPP Encoding Polynucleotides 

25 Full length polynucleotides are produced by extension of an appropriate fragnient of the full 

length molecule using oligonucleotide primers designed firom this fragment One primer is 
synttiesized to initiate 5* extension of the known fragment, and the other prinoer is synthesized to 
initiate 3* at^osion of the known firagment The initial primers are designed using OLIGO 4.06 
software Rational Bioscimces), or anoOiGr appropriate program, to be about 22 to 30 mfcleotidei iST 

30 length, to ba ve a GC content of about 50% or more, and to anneal to tbe targ^ sequrace at 

tenqperatures of about 68 ^^C to about 72 Any stretdi of nucleotides which would result in hairpin 
stroctures and primer-primer dimerizations is avoided. 

Sdected human cDNA libraries are used to extoid the sequence. If moreOianoneat^ion 
is necessary or desired, additional or nested sets of primes are designed. 

35 HighfideUty anqilificationisobtamedbyPCRusxognaBthods weUknowninAe^ PGR is 
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perfbnned in 96- weU plates ii^i^g the FTC-200t[h^ Th&ieacticminbi 
contains DNA teioplate^ 200 nmol of eacb prim^, leactioabufte containing Mg^% (NH4)2S04, and 2* 
noBrcaptoetlianol, Taq DNA poIymiKase (Ameisham Biosciraices), ELONGASB fflzyme Gixvitrogen), 
and Pfii DNA polymerase (Stratag^X ^ fbllo\idng parameters for primer pair PCI A and PCI 

5 B: St^ 1 : 94*C, 3 min; Step 2: 94**C, 15 sec; Step 3: 60"C. 1 min; Step 4: 68X, 2 min; Stqp 5: Steps 
2» 3, and 4 repeated 20 times; Step 6: 68**^ 5 min; St^ 7: storage at 4*'C. In the alternative, the 
param^ers for primer pair T7 and SKf are as follows: Stq> 1: 94''C» 3 mii^ Step 2: 94''C, 15 sec; 
St^ 3: ST'C^ 1 nmn; Step 4: 68"*^ 2 min; Step 5: Steps 2, 3, and 4 r^»eated 20 times; Stq> 6: 68**^ 5 
nun; Stqp 7: storage at 4*'C. 

10 The concentration of DNA in each well is detennined by dispensing 1 00 nl PICOGREEN 

quantitation reagent (0.25% (vAr) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and O.S /il of undiluted PCR product into each well of an opaque fluorimeter plate (Coming Costar, 
Acton MA), allowing the DNA to bind to the reagent Hie plate is scanned in a Huoroslcan II 
(Labsystens Oy, Helsinki, Finland) to measure the fluorescraice of ttie san^le and to quantify the 

15 concentration of DNA. A S /Lil to 10 a^I aliquot of the reaction mixture is analyzed by ^ectrophoresis 
on a 1 % agarose gel to detemsne which reactions are successful in extending the sequence. 

The ext^ided nucleotides are desalted and concentrated, transferred to 384-we!l plates, 
digested with CvUI cholera virus ^idonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Biosciences). For shotgun 

20 sequencing, the digested nucleotides are separated on low concentration (0.6 to 0.8%) agarose gels, 
fragments are excised, and agar digested with Agar ACE (Promega). Extended clones are religated 
using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham Biosciences), 
treated widi Pfu DNA polymi^ase (Stratagene) to fiU^in restriction site ov^angs, and transfected 
into conqpetent R coll cells. Transformed ceils are selected on antibiotic-containing media, and 

25 individual colonies axe picted and cultured ovmiight at 37 ''C in 384-wd] plates in LB/2x caib liquid 
media. 

TbeodOs arelysed, andDNAis an^Medby PCRusi]:%TaqDNApo]yna»:ase(AnQ^&diam 
Biosdences) and Pfu DNA polymerase (Stratagene) with the fbOowii^ parameters: Step 1: 94''C, 3 
nmn; Step 2: 94^C. 15 sec; Step 3: 60**C, 1 min; Stqp 4: 72''C, 2 min; Step 5: steps 2, 3» and 4^ ^ 

30 repeated 29 times; Step 6: 72''C, 5 min; Step 7: storage at 4*'C. DNA is quantified by PICOGREEN 
reag^ CMolecular Probes) as described above. Samples with low DNA recoveiies are reanqplified 
using the same conations as described above. Samples are diluted with 20% dimediysulfbxide (1:2, 
v/v), and sequenced using DYENAMIC en^gy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Biosciences) or the ABI PRISM BIGDYE Terminator cycle seqooacintg 

35 ready reaction kit (Applied Biosyst^ns). 
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In like nmnner, full length polymideolides are verified using the above procedure or are used 
to obtain 5' regulatory sequences using the above procedure along with oligonucleotides designed ft*r 
such extension, and an appropriate g^iomic library. 

IX. Identification of Single Nucleotide Ftolymorphisnis In KPP Encoding Polynucleotides 
5 Common DNA sequence variants known as single nucleotide polymorphisms (SNP&) are 

identified in SEQ ID NO:2 using the LIFESEQ database (Incyte). Sequences fix)m the same gene are 
clustered together and assenabled as described in Example m. allowing die identification of all 
sequence variants in the gene. An algoridmi consisting of a series of filters is used to distinguish 
SNPs fiom other sequence variants. Preliminary filters remove the noajority of basecaU errors by 

10 requiring a minifmim Phred quality score of 15, and remove sequence alignment errors and errors 
resulting from improper trimming of vector sequences, chimeras, and splice variants. An automated 
procedure of advanced cbromosome analysis is applied to the original chromatogram files in the 
vidnily of die putative SNP. Clone error filters use statistically generated algorithms to identify 
errors introduced during laboratory processmg, such as fliose caused by reverse transcriptase, 

15 polymerase, or somatic mutation. Clustering error filters use statistically generated algoriflmis to 
identify errors resulting firom clustering of close bomologs or pseudogenes, or due to contamination 
by non-human sequences. A final set of filters removes duplicates and SNPs found in 
immunoglobulins or T-cell receptors. 

Certain SNPs are selected jfor further characterization by mass spectrometry using the high 

20 fliroughput MASSARRAY qrstem (Sequenom. Inc.) to analyze Oldie fi^equencies at the SNP sites in 
four diHermt human populations. The Caucadan population compnses 92 individuals (46 male, 46 
female), including 83 firom Utah, four Frendi, fliree Venezualan, and two Amish individuals. The 
African population comprises 194 individuals (97 male, 97 female), all African Americans. Tie 
Hispanic population con^rises 324 individuals (162 male, 162 female), all Mexican Ifispanic. Hie 

25 Asian population comprises 126 individuals (64 male, 62 female) wifli a reported parental breakdovm 
of 43% Chinese, 31% Japanese, 13% Korean, 5% Vietnamese, and 8% other Asian. ADele 
ftequencies are first analyzed in die Caucasian population; in some cases diose SNPs which show no 
aUelic variance in this population are not furdier tested in die oflier fliree populations. 

Labeling and Use of IndividualHybridlzatlon Probes *' ' 

30 Hybridization probes derived firom SBQ ID NO:2 are eoqiloyed to screen cDNAs, gmonic 

DNAs, or niRNAs. Altiiough die labeling of oligonucleotides, consistmg of about 20 base pairs, is 
specifically described, esseotially die same procedure is used wifli larger nucleotide fragments. 
Oligonucleotides are designed using stat^-of-ttie-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by conabintqg 50 pmol of eac^i oligomer, 250 iiCi of [y-^^] adenosine 

35 triphosphate (AmM-sham Biosd«aices), and T4 polynucleotide kinase (DuPont NEN, Boston MA). 
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The labeled oligonucleotides are sabstandally purified tish^ a SEPHADEX G-2S suprafine size 
eKclusion dextnmbead cdoim (AmexshamBioscinces). An aliquot contaiTiing 10'' counts per 
muoute of ttie labeled probe is used in a ^ical inemhraiie-based hybridization analysis of human 
gemmnc DNA digested vnA one of the fbllowiiig endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, 

5 or Pvn O'CDuPont NEN). ^ 
The DNA from eacii digest is fractionated on a 0.7% agarose gd and transfened to NYTRAN 
PLUS o^on membranes (Sddncher & ScbueD, Duriiam NH). H^ridization is carried out for 16 
hours at AO^C. To remote nonspecific signals, blots are sequentially washed at room tm9)mtnre 
undfflT coudifioi^ of xqp to» for exBsnplG^ 0. 1 x saline sodium titrate and 0.5% sodKum dodecyl sulfate. 

10 Hybridization patterns are visualized usiag autoradiography or an alternative imaging means and 
conpared. 

XI. MIcroarrays 

The linkage or syn&esis of array elements upon a mtcroarray can be achieved utili:rixig 
photolifhography^ piezoelectric printing (inkrjet printing; see, e.g., Baldeschweiler et al., supraX 

IS mechanical nucrospotting technologies, and derivatives thereof. The substrate in each of the 

aforementioned technologies should be uniform and solid with a non-porous surface (Schena, M., ed. 
(1999) DNA^ficroanavs: A Practical Approach , Oxford University Press, London). Suggested 
substrates include silicon, silica, glass slides, ^ass chips, and silicon wafers. Alternatively, a 
procedure analogous to a dot or slot blot may also be used to arrange and link elements to the surface 

20 of a substrate using thermal, UV, chenucal, or mechanical bonding procedures. A typical array may 
be produced using available methods and machines well known to tiiose of ordioaiy skill in the art 
and may contain any appropriate nurob^ of elements (Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, Au and I. Hodgson (1998) Nat 
Biotechnol. 16:27-31). 

25 FuU length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 

conqirise the dements of the nmcmarray. Fragments or oligomers suitable for hybridization can be 
selected using software weU known in the art such as LASEStGENB software (DNASTAR). The 
array dements are hybridized ^th polynucleotides in a biological sample. The polynucleotides in 
the biological sample are coigugated to a fluorescrat label or oth^ molecular tag for ease of " 

30 detection. After hybridization, nonhybridtzed nucleotides from the biological sample are removed, 
and a fluorescence scanner is used to detect hybri(£zation at each array dement Alternatively, las^ 
desorbtion and mass spectrometry may be used for detection of hybridizatioxL The degree of 
con^lementarity and the rdative abtuidance of each polynudeotide \^4uch hybridizes to an dement 
on the microarray may be assessed. In one aabodiment, nucroarray preparation and usage is 

35 described in detail bdow. 
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Tissue or Cell Sample Preparation 

Total RNA is isolated £rom tissue san^les using the guanidinium Oiiocy anate method and 
poly(A)* RNA is purified using the oligo-(dT) c^ulose method. Each poly(A)* RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, O.OS pg//il oligo-(dT) primer Calmer), IX 

5 first strand buffi&r, 0.03 units//il RNase inhibitor, 500 fiM dATP, 500 fiM dGTP, 500 plM dTTP, 40 
liM dCTP, 40 dCTP-Cy3 (BDS) or dCTP-CyS (Amersham Biosciences). Hie leveise 
transcription reaction is performed in a 25 nd volume containing 200 ng polyCA)*^ RNA wifli 
GEMBRIGHT Idts (Incyte). Specific contiol poly(A)^ RNAs are synthesized by in vitro transcription - 
from non-coding yeast genoimc DNA« After incubation at 37°C for 2 tar, eadi reaction sanq>le (one 

10 with Cy3 and anoflier xvidi Cy5 labeliog) is treated witti 2.5 nd of OSb/L sodium hydroxide and 

incubated for 20 nunutes at SS'^C to the stop the reaction and degrade ttie RNA. Samples ace purified 
usiflg two successive CHROMA SPIN 30 gel filtration spin columns CBD Qontecli^ Palo Alto CA) 
and after combining, both reaction samples are etiianol precipitated using 1 ml of glycogm (1 
mgArd), 60 ml sodium acetate, and 300 ml of 100% ethanoL Ibe sanople is then dried to completion 

IS usmg a SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 fd 5X SSC/02% 
SDS. 

Microarrav Preparation 

Sequences of flie fnesent invention are used to generate array elements. Each array dement 
is amplified from bacterial cells containing vectors with cloned cDN A inserts. PCR anoplification 
* 20 uses primers conq)lementary to the vector sequences flanking the cDNA insert Array elements are 
amplified in tiiirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater tban 5 
f£g. Atxplified array elements are then purified using SEFHACRYL-400 (Amersham Biosciences). 

Purified array elemirats are immobilized on polymer-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
25 washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 

Scientific Ptoducts Corporation (VWR), West Chester PA), washed extexisively in distilled water, 
and coated with 0.0S% aminopropyl silane (Sigma-Aldrich, St Louis MO) in 95% ethanol. Coated 

slides are cured in a 1 10^ oven. ^ ^ . 

Airay elements are applied to the coated glass substrate using a procedure described in U.S. 
30 Patent No. S»807^22, incoxporated herein by reference. 1 fil of the array element DN A, at an average 
concentration of 100 ng/fd, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
35 Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
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buffered saline (PBS) CTfopix. Inc., Bedford MA) for 30 mimiles at 60»C followed by washes in 
0.2% SDS and distilled water as befoie. 

Hybridization reactions contain 9 /d of sanqde mixtare consistiiig of 0.2 ftg each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization bnffer. The sample 
mixtare is heated to fiS'C for 5 minutes and is aliquoted onto the mictoanay suifooe and covered 
with an 1 .8 cm' coverslip. The arrays are transfraed to a waterproof chamber haviqg a cavity just 
slightly larger flian a microscope sUde. The chamber is kept at 100% humidity intonaUy by tite 
addition of 140 ;tl of 5X SSC in a comer of (he diamber. The chamber containing die anays is 
incubated for about 6.5 hours at eCC. The arrays are washed for 10 ndn at 45"C in a first wash 
buffer (IX SSC, 0.1% SDS), duee times for 10 minutes each at 43''C in a second wash buffer (O.IX 
SSQ. and dried. 
Detection 

Reporter-labeled hybridization con^plexes are detected with a microscope equipped wiii an 
15 Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Qara CA) capable of generating spectnd lines 
at488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc.. Melville NY). The slide 
contaming die array is placed on a conQmter-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned wifli a 
20 resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophoies sequentially, 
knitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Biidgewater NJ) corresponduig to the two fluorophores. 
Ai^ropriate filters positioned between the array and the photomultiplier tubes are used to filter flie 
25 dgnals. The enussion maxima of flie fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. 
Each array is typically scanned twice, one scan per fluorophor« using the appropriate filters at the 
lasw source. aUhough die apparatus is capable of recording flie spectra firombotii fluorophores 
simoltanBou^. 

Ihe sraisitivity of the scans is typically calibrated usmg die signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
die array contams a eonaplementaiy DNA sequence, allowing the intensity of die signal at fliat 
location to be correlated wifli a weight ratio of hybridizing species of 1:100,000. When two samples 
fiom different sources (e.g., lepresaoting test and control cdls), each labeled wifli a difftaent 
fluorophore. are Itybridized to a single anay for flie purpose of idoititying genes fliat are 
drfferentially expressed, flie caUbration is done by labeling samples of the calibratiiig cDNA wifli die 
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two finoropbores and adding identical amounts of each to flie hybridization mbctoie. 

The output of flie photomolliplier tube is di^tized using a 12-bit RTI-83SH analog-to-digital 
(A/D) conveision board (Analog Devices, Inc., Norwood MA) instaDed in an IBM-conapatible PC 
oomputBr. Hie digitized data ate displayed as an image where the signal intensity is mapped using a 

S linear 20-co1ot tzansfomiation to a pseudocolor scale ran^ng from blue (low dgnal) to led Qiigh 
signal). The data is also analyzed quantitatively. Wheietwodififeientfluoiophoiesaceexdtedand 
measoced dmiiltaneously, the data axe first collected for optical cfosstalk (due to ov^apping 
emission spectra) between die finorophoies using each fluoiophore^s emission spectrum. 

A grid is superimposed over flie fluorescence signal image such that fixe signal finom each 

10 spot is centered in each element of the grid. The fluorescence signal wifliin each element is dien 
integrated to obtain a numericsd value conesponding to the average inteni^ty of the signal. Ibe 
software used for dgnal analyris is the GEMTOOLS gene rapression analysis program (Incyte). 
Array elements that exhibit at least about a two-fold change in expression, a signal-to-bacl^round 
ratio of at least about 2.5, and an docirait spot size of at least about 40%, are considered to be 

IS differentially expressed. 

XII. Complementary Pdlymideotides 

Sequences complementary to the KPP-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring KPP. Although use of oligonucleotides 
comprising &om about 15 to 30 base pairs is described, essentially the same procedure is used with 

20 smaller or with larger sequence fragments. Appropriate oligonucleotide are designed using OUGO 
4.06 software (National Biosciences) and die coding sequence of KPP. To inhibit transcription, a 
conqplementaiy oligonucleotide is designed from^ nx>st unique 5' sequence and used to prevent 
pFOTDoter binding to the coding sequence. To inhibit translation, a complenientary oligonucleotide is 
designed to prevent ribosomal binding to die KPP-encoding transcript 

25 Xm. Egression of KPP 

Expression and purification of KPP is achieved using bacterial or virus-based expression 
systenos. For expression of KPP in bactma, cDNA is subdoned into an appropriate vector containing 
an antibiotic resistance gene and an inducible pronnter that directs high levels of cDN A transcription, 
l&anqples of sucb promoters indude, but arc not limited to, the trpAac (fad) hybrid pnmiDter and t£e~ 

30 TS or T7 bacteriophage promoter in conjunction with the lac opmtor regulatory d^nraat 

Recombinant vectors are transformed into suitable bact»ial hosts, e.g., BL21(DE3). Antibiotic 
resistant bacteria express KPP upon induction with isopropyl beta-D-thio^actopyranoside (DPTG). 
Expression of KPP in eokaiyotic cells is achieved by iofiecting insect or mammalian cell Unes with 
mscmbiDaaaxAutographica califomica nudear polyhedrosis virus (AcMNPV), commonly Imown as 

35 baculovirus. The nonessentisdpolyhedringme of baculovirus is replaced widicDN A encod^ 
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by ejflier homologous recombination or bactmal-mediated transposilion involving transfo plasmid 

interoaediates. Viral inffegtSvily is matntainMi atwl thft gtrripg pftlyhwiTii^ pvewrwit^r firivftg ItSgh IftvelQ 

of cDNA transcription. Recombinant bacidoviius is used to infect Spodoptera firugiperda (Sf9) insect 
cells in most cases, or human hepatocytes, in some cases. Iniectionof the latt^reqiiiies additional 

5 g^iedc modifications to bacolovinis (Engelhaid, B.K. et al. (1994) Proa NatL Acad. Sci. USA 
91:3224-3227; Sandig, V. et al. (1990 Hum. GemTher. 7:1937-1945). 

In most expression systems* KPP is synthesized as a fusion protdn with, ewg., ghitatbione S- 
transferase (GST) or a peptide epitope tag» such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein firom CTde cdl lysates. GST, a 26- 

10 Idlodalton euzyme finom Schistosoma japonicum, enables the purification of fusion protdns on 

immobilized gihitatbione under conditions that maintain protein activity and antigenicity (Ami^sbam 
Bioscimces). Followii^ purification, the GST moiety can be proteolyticalty cleaved fiom KPP at 
specifically enginemd sites. FLAG, an 8-amino add peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 

15 His, a stretch of six consecutive histidine residues, enables purification on inetal-chelate resins 

(QIAGEN)> Methods for protdn expression and purification are discussed in Ausubel et al. (jsupra^ 
ch. 10 and 16). Purified KPP obtained by these methods can be nsed directly in the assays shown in 
Exan5)les XVII, XVm, XIX, XX, and XXI, vihsx^ applicable. 
Xrv* Functional Assays 

20 KPP function is assessed by expressing the sequences encoding KPP at physiolo^caUy 

elevated levds in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include PCMV SPORT plasmid (Invitrogen, Carlsbad CA) and PCR3,1 plasmid (Invitrogen), both of 
which contain the cytomegalovirus promoter. 5-10 //g of recombinant vector are transiently 

25 transfected into a human cell line, for exaix^le, an oidothelial or hematopoietic cdl line, using either 
liposome fomadations or dectroporation. 1-2 /^gofan additional plasmid containing sequences 
encoding a maxlcer protein are co-transfected. Expression of a marker protean provides a means to 
distinguish transfected cells firom nontransfected ceDs and is a rdiable predictor of cDN A expression 
fiom the recombinant vector. Marker piotdns of choice include, e.g., (Sreen Fluorescent ProtSnT ^ 

30 (GFP; BD Clontech), CD64, or a CD64-GFP fusion protdn. Flow cytometry (FCM), an automated, 
laser optics-based teduuqne, is used to identify transfected cells expressing GFP or CD64-GFP and to 
evaluate the apoptotic state of the cells and o1h» ceSular properties. FCM detects and quantifies the 
uptake of fluorescent molecules tiiat diagnose events preceding or coincident with cell death. These 
events include changes in nuclear DN A contmt as measured by staining of DNA with propidium 

35 iodide; changes in cell size and granularity as measured by forward ligiht scatter and 90 degree side 
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ligjit scatter, do wn-regulation of DNA synthesis as measnred by decrease in bnanodeoxyoridiiiB 
uptake; alterations in expression of ceU surface and intracdhUar proteins as measuied by reactivity 
with specific antftodies; and alterations in plasma membrane composition as measuied by the binding 
offluoiesceinK^njugatedAmiexinV protein to the cen surface. Mefliods in flow cytometry are 
discussed in Ormerod, M.G. (1994; Flow Cytometry Oxford, New York NY). 

Tlie inflnoice of KPP on gene expression can be assessed using highly purified populations 
of c€ais transfected with seqaeaces encoding KPP and either CD64 or CD64-GFP. CD64 and CD64- 
GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immnnofiJobulinOagG). TWected cells are efficiently separated fi^nontransfectedcdls using 
maguetic beads coated with dflier human IgG or antibody against CD64 (DYNAL. I^e Sua^^ 
NY). mRNAcanbepurifiedfiomfliecells using methods well known by Aose of skill in the art. 
Expression of mRNAencodiflg KPP and other genes of interest canbe analyzed by norfliem analysis 
or nacroatray techniques. 

XV. Fraducaon of KPP Spedflc Antibodies 

KPP substantially purified using polyacrjaamide gd dectrophoitssis (PAGE; see. e.g.. 
Harrington. M.G. (1990) Methods Enzymol. 182:488-195). or other purificationteduiiques.il used to 
immunize animate (e.g.. rabbits, mice, etc.) and to produce antibodies using standard protocols. 

Altemativdy. the KPP amino add sequence is anatyzed usmg LASERGENE software 
(DNASTAR) to determinB regions of highimmnnogeaiidty, and a conesponding oligopeptide is 
syn«hesizedandusedtoiaiseantiTx,diesbynwinsknowntothoseofskin Methods ftr 

selection of appmpriate epitopes, sudi as those near the C-terminus or in hydrophiKc regions an* wdl 
desoibed in the art (Ansnbd et al., supra, cb. 1 1). 

Typically, oligopeptides of about 13 residues in lengflj are synfliesized using an ABI 431A 
peptide synthesizer (AppKed Bioq«tems) using mOC diemistry an^ coupled to KLH (Sigma- 
25 Aldrich. St MO) by reaction wilbN.maldmid«RMsnz«qd.N.^ 

increase immunogenicity(Ausubdetal..5Hpn,). Rabbite are immunized with flie oligopeptide-KLH 
cornplex in complete Freund's adjuvant Resultiflg antiseia are tested fbr aitipeptide and anti-KPF 
activity by. for example, binding the peptide or KPP to a substrate, bloddng with 1% BSA. reactiflg 
with rabbit antisera, washing, and reacting wifli radio-iodinated goat anti-rabbit IgO. - - - — ~ - 
30 XVI. Purification of Naturally Occurring KPP Using Spedflc Antibodies 

Naturally occurring or recombinant KPP is substantiany purified by immunoaffinity 
chromatography using antibodies specific for KPP. An immunoaffinity cohimn is constructed by 
covalently coupling anti-KPP anUT,ody to an activated diromatographic resin, sudi as CNBr-activated 

SEPHAROSE(AmershamBiosciences). AfterthecoupliBg.theiesinisblodcedandwad»d • 
35 according to the manufacture's instructions. 
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Media containfaig KPP are passed over the immimoafBiuty column, and the cohmm is washed 
uiuta- conditions that aDow ^ prefe^ntial absorbance of KPP (e.g.. high iomc strenglh buffers in the 
pnsence of detergent). Tl»cohMna is eluted under conditions that disrupt antibody/KPP binding 

(e.g. . a buffo of pH 2 to pH 3. or a high concentration of a chaotrope. su<A as mea or tWoc^ 
ion), and KPP is coIlectBd. 

XVn. Identification of Molecules mich Interact with KPP 

KPP, or biol«^cally active fragments thereof, are labeled with >«I Bolton-Hunter reagent 
(Bolton. AE. and W.M. Hunter (1973) Biochem J. 133-529.539). Candidate n»lecules previously 
armyed in fl» weDs of a nrnto-wdi plate are incubated with flie labded KPP. washed, and any wells 
MdAhibdedKPPcon^flexaieassayBd. Data obtained using different concentrations of KPP are used 
to calcuhite vahies for flie nnnibw, affinity, and assodation of KPP wifli the candidate n»lecules. 

Alteniatively. molecules interacting widi KPP are analyzed using die yeast two-hybrid 
system as described in Fields. S. and O. Song (1989; Nature 340:245-246), or using commerciaUy 
available kits based on the two-hybrid system, sue* as flie MATCHMAKER system (B D Qontech). 

KPP may also be used in ae PATHCALLING process (CuraGen Corp., New Haven CT> 
whidi employs the yeast two-hybrid system in a high-thwughput manner to detenrine all interactions 

betweenfl»pi»tdnsencodedbytwolaigeUbrariesofgenesCNandabalan,K.etaL(2000)U.S. 
Patau No. 6,057,101). 

XVm. Demonstration KPP Activity 

Generally, protem kinase activity is measured by quantifying phosphorylation of a protein 
^bstratebyKPPinfliepresenceof [y.«PlATP. KPP is incubated wifli the protdn substrate. 
«P-ATP.andanapp«,priatekinasebu£fer. The «P incorporated into the substrate is separated from 
free^.ATPbydectro0«««sisandfl»incorporated«Pisc^ ^ 
amountofhicorporated^PisproportionaltotheactivityofkPP. A detetmfaiation of fte specific 
aminoacidresiduephosphorylatedismadelqrphos0u»mnK,addanatysisofttehy^^ - 

hi one alternative, protein kinase activity is measured by quantifying the transfer of gamma 
phosphate fiom ad«iosine triphosphate (ATP) to a serine, threonine or ty„«dne residue in a protein 
substrate. The reaction occurs between a protein kinase sample with a biotinjdated peptide substrate 

and gamma '^P.ATP. Following die reaction, free avidin in solution is added for binding to fl» 

biotinylated^P-peptide product TTie binding sample then undergoes a centrifugal ultrafiltration 
process with a membrane which will retain the product-avidin complex and allow passage of free 
gamma ^^-ATP. The reservoir of the centriluged unit containing die '^-peptide product as retentate 
is aien counted in a scintillation counter. This procedure aDows die assay of any type of protdn 
kinase sample, depending on the peptide substrate and kinase reaction buffer selected. This assay is 
provided in kit form(ASUA. Affinity Ultrafiltration Separation Assay. Tiansbio Corporation. 
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BaltinaDie MD, U.S. Pat^ No. 5,869*275)* Sng^sted substrates aod flidr tespective eaasytnes 
include but are not Unnted to: HistoneHl (S]gim)aiidp34*^'''%in^ 

and EGF receptor Idnase, Aimexm II and src kinase, ERKl & ERK2 substrates and MEK, and n^elin 
basic protean and ERK (Pearsoh, J.D. et al. (1991) Methods Enzymol. 200:62-81). 

5 In anoOi!^ alternative, protean Idnase activity of KPP is demonstrated in an assay containing 

KPP, 50 Ml of kinase buffi^, 1 |ig substrate, such as nogrelin basic protdn O^ABP) or syndietic peptide 
substrates. 1 mM DTT. 10 (ig ATP, and 0.5 yCi [y-^PlATP. The reaction is incubated at 30^*0 for 30 
nnnutes and stopped by pipetting onto P81 paper. The unincorporated [y-^] ATP is rraioved by 
wasU]^ and the iiicorporatedradioactivitir is measured usin^ Alternatively, 

10 the reaction is stopped by heating to lOO^'C in the presence of SDS loaifing bu£G^ and resolved on a 
12% SDS polyacrylamide gel followed by autora^ography. He amount of incorporated ^^P is 
proportioaal to the activity of KPP. ^ 

In yet anofiier alt^native, adenylate kinase or guanylate kinase activity of KPP may be 
measured by the incorporation of ^*P firom [y-^^P] ATP into ADP or GDP using a gamma radioisotope 

IS counter. KPP, in a kinase buffer, is incubated together with the appropriate nucleotide 

mono-phosphate substrate (AMP dr GMP) and ^-labeled ATP as the phosphate donor. Ihe reaction 
is incubated at 37^C and terminated by addition of trichloroacetic acid. The acid extract is 
neutralized and subjected to gel electrophoresis to separate the mono-, di-, and triphosphomicleotide 
fractions. Hie diphosphonucleotidefiraction is excised and counted. The radioactivity recovered is 

20 proportional to the activity of KPP. 

In yet another alternative, other assays for KPP include scintillation proximity assays (SPA), 
scintillation plate technology and filter binding assays. Useful substrates include rec om binant 
pfotdns tagged with glutathione transferase, or synthetic p^tide substrates tagged with biotin. 
Inhibitors of KPP activity^ such as small organic molecules, proteins or peptides, may be identified by 

25 such assays. 

In another alternative, phosphatase activity of KPP is measured by the hydrolysis of para- 
nitropher^l phosphate (PNPP). KPP is incubated together with PNPP in HEPES buffer pH 7-5, in the 
presence of 0.1% p-meicaptoethanol at 37 **C for 60 nmn. The reaction is stopped by Ae addition of 6 
ml of 10 N NaOH (Diamond, R.H, et al. (1994) MoL CdL BioL 14:3752-62). Alteriiatively; acid 

30 phosphatase activity of KPP is dCTOonstrated by incubating KPP-containixig extract with 100 ^1 of 10 
niM PNPP in 0. 1 M sodium citrate, pH 4. S , and 50 jLd of 40 mM NaCl at 37 '^C for 20 nmn. The 
reaction is stopped by Oe addition of 0.5 nil of 0.4 M g;iycine/NaOH» pH 10.4 (Saftig, P. et al. (1997) 
J. BioL Oiem. 272:18628-18635)* The increase mli^t absorbance at 410 nmresiiltiiig from Che 
hydrolysis of PNPP is measured using a spectrophotometer. Hie increase in light absorbance is 

35 proportional to the activity of KPP in the assay. 
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Indiealteniative^ KPP activily isdetmrniedbyineasiiri^ 
firomaphosphorylatedpioteinsubstTatiew Reacdoiis aiep^fimxied vd1h2or 4DMIQ'Pmafiiial 
volam of 30 ^ containing 60 niM Tds , pH 7 .6 , 1 nM EE3TA, 1 rsM EGTA, 0.1% 
p-xDBTcaptoethanol and 10 fiM sobscrate* ^P-labded on smne/threonine or tyxosine, as appiopriatew 
5 Reactions are initiated with substrate and incubated at 30^ C for 10-15 noin. Reactions are gn^ched 
wi& 4S0 111 of 4% (w/v) activated chaicoal in 0.6 M HQ, 90 nM Na4P207, and 2 nM NaH2P04, dm 
craitrifiiged at 12,000 x g for 5 nan. Acid-soluble ^ is quantLBed by liquid sdntDlation counting 
(Sinclair, C. et al. (1999) J. BioL ChenL 274:23666-23672). 

XIX. Kinase Binding Assay 

10 Binding of KPP to a FLAG-CD44 cyt fusion protein can be detenmned by incubating KPP 

widi anti-KPP*conjugated immunoafOnity beads followed by incubating portions of the beads (having 
10-20 ng of protdn) witii 0.5 inl of a binding buffer (20 nM Tris-HCL (pH 7.4), 150 nM NaCl, 0.1% 
bovine senun albumin, and 0.05% Triton X-100) in the presence of ^^I-labeled FLAG-CD44cyt 
fusion protein (5,000 cpm^ protein ) at 4 **C for 5 hours. Following binding, beads were washed 

IS thoroughly in the binding buff<^ and ttie bead-bound radioactivity measured in a scintillation counter 
(Bourguignon, L.Y.W. et al. (2001) J. Biol. Chenx 276:7327-7336). The anoount of incorporated 
is proportional to die amount of bound KPP* 

XX. Identification of KPP Inhibitors 

Conqiounds to be tested are arrayed in the weQs of a 384-weIl plate in varying concentrations 
20 along with an appropriate buffer and substrate, as described in the assays in ExanQ>le XVII. KPP 
activi^ is measured for each well and the ability of each conq>ound to inhibit KPP activity can be 
detmnined, as well as the dose-rei^me kinetics. This assay could also be used to identify naolecoles 
which «hance KPP activiQr. 

XXI. Idmtificatlon of KPP Substrates 

25 A KPP ''substrate-trapping*' assay takes advantage of the increased substrate a£Bni1y that may 
be conferred by certain mutations in the PTP signature sequence of protdn tyrosine phosphatases. 
KPP bearing tiiese mutations form a stable compla with tbdr substrate; this conqd« may be isolated 
biodienncaDy. Site-directed mutagenesis of invariant residues in the FTP signature sequence in a 
clone encoding the catalytic domain of KPP is perfomoed using a.mediod standard in die art or a ' ' 

30 comnimial kit, such as the MUTA-GENBldt from BIO-R^ For expression of KPP mutants in 
Escherichia colU DNA fragments containing the mutation are exchanged with the correspondii^ 
wild-type sequence in an expression vector bearing the sequence encoding KPP or a glutathione 
S-transferase (GST)-KPP fusion protein. KPP mutants are expressed in R coli and purified by 
duomatograidxy. 

35 The expression vector is transacted into COSl or 293 cdOs via calcium phosphate-mediated 
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traiisfecdonvndi20MgofC5a-parifiedDNAp^lOKmd^ Forty- 
right hoois after transfecticm, cells axe stimulated witid 100 ng/ml qpidennal growthfiactor to increase 
tyioaiiephosplioiylatioamcells, astbe^^xom^ Cdlsare 
lysed in 50 nM Tds-HOi. pH 7*5/5 nM EDTA/150 mM NaCl/1% Triton X-100/5 noM iodoacetic 

5 add/10 inM sodtomphosphate/lO mM NaF/5 ftgAsfl l&xp&pdn/S (ig/nil aprotimn/1 nMbenzanndine 
(1 ml per 10-cm^sh, 0.5 ml p^ 6-cmdish). KPP is immimoprecdpitated firomlysates widi an 
appropriate antibody. GST-KPP fusion protems are predpitated with ghitathione-Sqiharose, 4 |ig of 
mAb or 10 |il of beads respective^ per of cell lysate. Complexes can be visualized by PAGE or 
fur&er purified to identify substrate molecules (Flint, AJ. et sd. (1997) Proc. Natl. Acad. Sd. USA 

10 94:1680-1685). 

Various modifications and variations of the described compositions, methods, and systems of 
flie inVCTtion win be apparent to those slcilled in the art without departing from the scope and spirit of 
the invention. It will be appreciated that tbe invention provides novel and useful proteins, and their 

15 encoding polynucleotides, which can be used in fte drug discovery process, as weU as methods for 
using these conqiositions for the detection, diagnosis, and treatment of diseases and conditions. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments^. 
Nor should the description of ^ch enibodiments be considered exhaustive or linit the invention to 

20 the precise forms disclosed. Furthermore, elements from one embodim^ can be readily recombihed 
with elements from one or more other enibodiments. Such conibinations can form a nuinber of 
embodiments within the scope of the invration. It is intended that the scope of the invention be 
defined by the following claims and their equivalmts. 
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L An isolated polypqitidesdected from flie group ccxnsi^^ 

a) fl pnlypfipHdft nfimprjging ati airwnn acid geqnenge selected ftmn the gmiip congisring 

ofSBQIDNO:!, 

b) a polypeptide conq^iisiiig a natnraBy occurring andno add sequmce at least 90% 
identical to an amino add sequence selected from flie group consisting of SEQ ID 
NO:l. 

c) a biolo^cally active fr agmait of a polypq)tide having an ammo acid sequmce 
sdected fromthe group consisting of SEQ ID NO:l« and 

d) an immunogenic fragment of a polypeptide having an annno acid sequence selected 
from the group consisting of SEQ ID NO;l. 

2. An isolated polypqrtide of claim 1 comprising an amino add sequence selected from the , 
group consisting of SEQ ID NO:l. 

3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. Aq isolated poljraucleotide encoding a polypeptide of claim 2. 

5. An isolated polyaucleotide of claim 4 con^rising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:2. 

6. A recombinant polynucleotide conqyiising a promoter sequence operably liniced to a 
polynudeotide of daim 3. 

7. Aceiatransformed with a recombinant polynucleotide of daim 6. 

8. A transg^corgamsmcon^iisipg^azeconjbinant polynudeotide - " ' 

9. A noethod of produdng a polypeptide of daim 1 , the method con^iising: 

a) cultuiing a cell under conditions suitable for expression of the polypqitide^ i^rdn 
said cell is transformed widi a reconibinant polynucleotide, and said recontbinant 
polynucleotide comprises a promoter sequence operabty linked to a polynudeotide 
ocodiDg the polyp^tide of claim 1, and 
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b) lecoveaiiig a» polypeptide so eiqiressed. 



10. AinBflM>dofclaim9,whereinthepolypeptidecoin^^ 
selected fitamHie group consistiiig of SEQ ID NO:l. 

11. An isolated antiboifyw*i(ftspecificany binds to a polypqjtide of daim 

12. An isolated polynucleotide selected from the group consisting of: 

a) a Poly^cleotideconyrisiflg a polynucleotide sequrace selected fiomtiie group 
consisting of SEQ ID N02, 

b) a PoJy*»cJ«»tidecoiiyrising a naturally occurring polynucleotide sequent 
98% identical to a polynucleotide sequence selected ftomthe group consisting of 
SEQIDNOa. 

" polynucleotide complementaiy to a polynucleotide of a), 

d) a polynucleotide conipleinraitary to a polynucleotide of b), and 

e) anRNAequivaloitof a)-d). 

13. Anisolatedpolynudeotideconqirising at least 60 contiguous nudeolid^ 
polynucleotide of claim 12. 

14. A metliod of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, die method con^nisii^: 

a) Iq^ridizing fl» sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in flie sanqde. 
and which probe specifically hybridizes to said target polynucleotide, under 
conditions whereby a hybridization complex is formed between said probe and said 
target polynucleotide or firagoents dieieof. and 

b) detecting the presence or absence of said hybridization cobles, and. optionally, if 
pres^it. tibe anxrant thereof.. . - - - r 

15. A method of claim 14, wfaen»n (he probe conynses at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target pdynndeolide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifying said target polynucleotide or fragment thcaeof usi^g pcdymerase chain 
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reacdon an^lificatiQii, and 
V) detecting tbo presence or abs»ce of said anplified target polynndeolide or fragvneiit 
ibimaf, and, optional^, if present, the amoimt flimof. 

17. A conq^osition conq>rising a polypeptide of didm 1 and a pbannaceaticaSy acceptable 
exdpient 

18. Acon^ositionofclainil?, whemntiiepolypqytidecoi^^ 
selected from flie group consisting of SEQ ID NO:l. 

19. A method for treating a disc^ase or condition associated with decieasedexpr^^ 
functional KPP, conqnisipg administaring to a patient in need of such treatment the coinposition of 
claim 17. 

20. Amethod of screening a confound for effectiveness as an agonist of a polypqitide of 
claim 1, the method conqixising: 

a) contacting a sanq)le conqnising a polypq>tide of claim 1 with a conqpoimd, and 

b) detecting agonist activity in die sanqile. 

21. A conq)osition coxx^rising an agorist compound identified by a method of claim 20 and a 
phanmceutically acceptable excipi^t. 

22. A me&od for treating a disease or condition associated with decreased expression of 
functional KPP« conq)rising adnnnisteiing to a patient in need of sudi treatment a composition of 
claim 21. 

23. A method of scre^iing a confound for effectiv^oess as an antagonist of a polypeptide of 
claim 1, the no^faod coixq>rising: 

a) contacting a sanqfle comprising a polypeptide of claun 1 witix a con^iihd, and ^ 

b) detecting antagonist activity in the sanqile. 

24. A coinposition compiisixg an antagonist con^und identified by a method of claim 23 
and a pharmacentically acceptable excipient. 

25. A method for treating a disease or condition associated with oviaiexpression of fonctimal 
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KFP, conq>iisiDg adnmiistmi^ to a patient in need of sndi treatment a conqposition of claim 24. 

26, Amefliodof scteeaiiiig for a conq>GuxM3tlmt specifically b 
1, the metbod conqirisii^. 

S a) combining the polypeptide of daiml wifliatleast cme test cony oundiiiidy suitable 

conditions* and 

b) detectingbindingof the polypeptide of claim 1 to the test coi^^ 

identifying a compound tjiat specifically binds to the polypqitide of claim 1 . 



10 27. Amethodof scieening for a conqmind that modulates the activity of tte 

claim 1, the method conqxrising: 

a) combining the polypeptide of claim 1 wifli at least one test compound under 
conditions pemdssive for the activity of the polyp^tide of claim 1, 

b) assessingtheactivityof the polypeptide of claim 1 in the pres^ice of the test 
15 conqpound, and 

c) concQiaringtheactivity of the polypeptide of claim 1 in the presence of the test 
confound with the activity of the polypeptide of claim 1 in the absence of the test 
confound, wberdn a change in the activity of the polypeptide of claim 1 in the 
presence of the test coxjapouvd is indicative of a confound tbat modulates the activity 

20 of the polypeptide of claim 1 . 

28. A method of screening a con^und for effectiveness in altering expression of a target 
polynucleotide, wh^rdn said target polynucleotide comprises a sequence of claim 5, the method 
comprisiDg: ^ 

25 a) contacting a sanqde coiiyrising the target polynucleotide with a con^)oux^ 

conditions suitable for fiie expression of the target polynucleotide^ 

b) detecting altered expression of ttie target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of vaiying 
amounts of the cony ound andjn the absence of tiie coicpound. - - ^ ' 

30 ' 

29. A method of screemog for pot^itial tojddty of a test compound, die method comprising: 

a) treating a biological sample containmg nucleic acids with the test conqiound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 

35 whoceby a specific hybridization complex is fomoed between said probe and a taiget 
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polyiiacleotide in the biolo^cal sample, said target polynucleotide conq>nsing a 
po^nanicleotide seqaence of a polynucleotide of claim 12 or firagm^ thereof, 

c) quantiiyiixg the amount of hybridization conq)lex, and 

d) coiEq)ariiig the amount of hybridization con:^le3& in the treated biological sairpl& with 
5 die amount of hybridization con^lex in an untreated biolo^cal sanq)le, wherein a 

difG^^ence in the amount of hybridization complex in die treated biological sanqile 
indicates potential toxicity of die test confound. 



30. A mediod for a diagnostic test for a condition or disease associated with the expression 
10 of KPP XQ a biological sanqile, the mediod conqirising: 

a) combining the biological sample with an antibody of claim 1 1 , under conditions 
suitable for the antibody to biad the polypeptide and form an antibody:polypeptide 
conq>lex» and 

b) detecting the conqdex, viimm the presence of die con^ilex correlates with tibe 
15 presence of the polypeptide in die bido^cal sanqple. 

31. Theandbodyof claim 11, wherein tiie antibody is: 

a) a chimmc antibody, 

b) a single chain antibody, 
20 c) a Fab fragment, 

d) a FCab*)2 fragm^ or 

e) a humanized antibody. 



25 



32. Aconqiositioncon^risiiiganantniodyof daimll and anaccqptableexci^^ 

33. A nistiiod of diagnosing a conffition or ^easeassodatedwiOi the expression of 

a subject, conqyrising adnanistmi^ to said subject an effective amount of the conqKisition of claim 
32. 

30" 34. A composition of claim 32, iurdtercoaqirising a labd. 

35. A method of diagnosing a condition or disease associated with the expression of KPP in 
a subject, conqirising administering to said subject an effective amoimt of the composition of claim 
34. 



35 



36. A mediod of preparing a polyclonal antibody with the specificity of the antibody of claim 
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1 1 , diie melliod conq^risiiig: 

a) ttttmuniTrng an ammal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO:l, or an immunogenic firaga»it 
thereof » under conditioxis to elidt an antibody response, 
5 b) isolatii^ antibodies from the animal, and 

c) scre^iing the isolated antibodies with die polypeptide, thereby identifying a 
polyclonal antibo^ whidi specifically binds to a polypeptide comprising an annno 
acid sequence sdected firom the group consisting of SEQ ID NO:l. 

10 37. Apolyclonalantibody produced by a method of claim 36. 

38. A con[g)osition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. An^thodofmaldngamonoclDnalantibody m& the specificity of the a^^ 
15 11, llie naeOiod conq>rising: 

a) immunizing an animal with a polypeptide consisting of an amino add seqgaence 
sdected firom the group consisting of SEQ ID NO:l, or an immuno^nic firagment* 
thereof, nnd^ conditions to ^dt an antibody response, 

b) isolatiiig antibody producixig cells firom the animal, 

20 c) fusing Ae antibody produdng ceUs with immortalized ceS^ 

antibody-produdi^ hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating firomtte culture inonoclonal antibody wUdispecifical^ 

polypeptide comprising an amino add sequence selected from the group consisting of 
25 SEQIDNO:!. 

40. Amonoclonalantibodyprodttcedby a method of claim 39r 

41 . A composition comprising the monockmal antibocfy of claim 40 and a suitable carri^r.^ , 

42. The antibo^ of claim 11, wherdn the antibody is produced by screeiung a Fab 
expression library. ' 

43. The antibody of claim 1 1, wherein the antiboi^ is produced by screoiing a recombinant 
35 immunoglobulin library. 
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44. A meOiod of detecting a polypeptide conqprisiDg an anuno add sequence selected firom 
die group consisting of SEQ ID NO:l in a sample, tbe method conoprising: 

a) incubatipg the antibody of datm 11 with ttie sanqdennd^ conditions to allow 
specific binding of the antibody and the polypeptide, and 
5 b) detecting specific binding, i^^^ierein spe<dfic binding indicates the presence of a 

polypeptide con^rising an amino acid sequence selected firomtbe group consisting of 
SEQ ID NO:l in flie san^le. 

45. A method of purifying a polypeptide comprising an annno acid sequence selected from 
10 the group consisting of SEQ ID NO:l from a san^>le, the method conq^rising: 

a) incubatibag the antibody of claim 1 1 with the sanq)le under conditions to allow 
specific binding of the antibody and the polypeptide, and 

b) separating the antibody from tiie sanqple and obtaining the purified polyp^tide 
donq)risi^g an anmno add sequeaice selected from the group consisting of SEQ ID 

15 NO:l. 



46* Anm(mairay wh^^atleastonedlementof fhemicrosurrayisapolym 



13. 



20 47. Amethodof generating an expression profile of a sample ^^ch contains 

polynucleotides, the method conoprising: 

I 

a) labeling the polynucleotides of the sanqde, 

b) contacting the demmts of the microairay of claim 46 with the labeled 
polynucleotides of the sanqde under conditions suitable for the fiirmation of a 

25 hybridization con^lex, and 

c) quantii^dne the expression of the polynucleotides in the sample. 



48. An array coniprisiqg HSmsA nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wheroin at least one of said nudeotide molecules comprises a first 

30 oligonucleotide or polynucleotide sequence specifically hybridizable with at least 30 contiguous 
nucleotides of a target polynucleotide, and whrnin said target polynucleotide is a polynucleotide of 
claim 12. 

49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
35 completely con^lementary to at least 30 contiguous nucleotides of said target polynucleotide. 
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50. Ananray of claim 48» wlimin said fixstoHgotmdeotide or 

complete conqxImKDtaiy to at least €0 contiguous nucleotides of said target polynucleotide. 

51. Anaitay oFclaim48» Mdicieansaidfiistoligoiiudeotideor^ 
5 conapletety conqplCTaentaiy to said targ^ poifynucleodde. 

52. AnafrayofcIaim48»i^diisanucioaiTay. 

53. An array of claim 48» iurther comprising said target polynucleotide bybridized to a 
10 nucleotide molecule coniprising said.first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wlier^ a linker joins at least one of said nucleotide molecules to 
said solid substrate. 

15 55. An array of claim 48, wh^ein each distinct physical location on ^e substrate contains 

multiple nucleotide molecules, and &e multiple nucleotide molecules at any single distinct physical 
location have the same sequence, and each distinct physical location on the substrate contains 
nucleotide molecules having a sequence which dii^G^ fiom the sequrace of nucleotide molecules at 
anottier distinct physical location on the substrate. 

20 

56. A polypeptide of claim 1 , conqprising the axmno acid sequence of SEQ ID NO: 1 . \ 

57. A potynudeotide of daim 12, conq)rising the polynucleotide sequence of SEQ ID NO:2. 

25 
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ABSTRACT OF THE DISCLOSURE 

Various CTbodimsnts of flie invralicin provide famnan kinases and phosphatases (KPP) and 
polynndeotides^««iicA identify and CTcodero Embodimnts of the invMlion also provide 
cacpression vectors, host ceJls, antibodies, agonists, and a^ Other embodiments provide 

mefliods fiwr diagnosing, treating, or preventii^g disordm associated wifli abenant expression of KPP. 
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<110> FORSYTHE, Ian J.; Z.EE, Sdo Y.; 
CKAWLA., Narinder R. 

<120> KINASES AND PHOSPHATASES 

<130> PP-1722 P 

<140> To Be Assigned 
<14I> Herewith 

<160> 2 

<170> PERL Program 
<210> 1 

<211> 495 J 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7526180CD1 



<400> 1 






















Met 


Cys 


Gin 


Ala 


Pro 


Cys 


Trp 


Arg Ala 


Gly Gly 


Ser 


Gly Leu 


Gly 


1 








5 








10 






15 


Arg 


Cya 


Ser 


Leu 


Cys 
20 


Arg 


Ser 


Cys Ser 


Leu Ala 
25 ■ 


Arg 


Phe Pro 


Arg 
30 


Leu 


Pro 


Ser 


Phe 


Pro 


Pro 


Pro 


Gly Arg 


Leu Arg 


Ala Gly Val 


Cys 


Ala 








35 








40 






45 


Arg 


Glu 


Gly 


Glu 


Gly 


Val 


Gly Gly 


Val Gly 


Gly Gly Val 


Pro 


Val 
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55 






60 


Pro 


Lys 


Arg 


Pro 


Ala 


Glu 


Gly Gly 


Gly Gly 


Cys, Glu Gly 


Leu 


Arg 


Glu 






65 








70 






75 


Ala 


Met 


Asp 


Val 


Glu 


Arg Leu 


Gin Glu 


Ala 


Leu Lys 


Asp 


Phe 








80 








85 




90 


Glu 


Lys 


Arg 


Gly 


Lys 


Lys 


Glu Val 


Cys Pro 


Val 


Leu Asp 


Gin 


Phe 
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100 






105 


Leu 


Cys 


His 


Val 


Ala 


Lys 


Thr Gly 


Glu Thr 


Met 


He Gin 


Trp 




Gin 






110 








115 






120 


Ser 


Phe 


Lys 


Gly 


Tyr 


Phe 


lie Phe 


Lys Leu 


Glu 


Lys Val 


Met 










125 








130 




135 


Asp 


Asp 


Phe 


Arg 


Thr 


Ser 


Ala 


Pro Glu 


Pro Arg 


Gly 


Pro Pro 


Asn 










140 








145 




150 


Pro 


Asn 


Val 


Glu 


Tyr 


He 


Pro 


Phe Asp 


Glu Met 


Lys 


Glu Arg 


He 










155 








160 




165 


Leu 


Lys 


lie 


Val 


Thr 
170 


Gly 


Phe 


Asn Gly 


He Pro 
175 


Phe 


Thr He 


Gin 
180 


Arg 


Leu 


Cys 


Glu 


Leu 


Leu 


Thr 


Asp Pro 


Arg Arg 


Asn 


Tyr Thr 


Gly 


Thr 








185 








190 






195 


Asp 


Lys 


Phe 


Leu 


Arg 


Gly 


Val Glu 


Lys Asn 


Val 


Met Val 


Val 


Ser 








200 








205 






210 


Cys 


Val 


Tyr 


Pro 


Ser 


Ser 


Glu Lys 


Asn Asn 


Ser 


Asn Ser 


Leu 


Asn 








215 








220 






225 


Arg 


Met 


Asn 


Gly 


Val 


Met 


Phe Pro 


Gly Asn 


Ser 


Pro Ser 


Tyr 
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230 235 240 

Thr Glu Arg Ser Asn He Asn Gly Pro Gly Thr Pro Arg Pro Leu 

245 250 255 

Asn Arg Pro Lys Val Ser Leu Ser Ala Pro Met Thr Thr Asn Gly 

260 265 270 

Leu Pro Glu Ser Thr Asp Ser Lys Glu Ala Asn Leu Gin Gin Asn 

275 280 285 

Glu Glu Lys Asn His Ser Asp Ser Ser Thr Ser Glu Ser Glu Val 

290 295 300 

Ser Ser Val Ser Pro Leu Lys Asn Lys His Pro Asp Glu Asp Ala 

305 310 315 

Val Glu Ala Glu Gly His Glu Val Lys Arg Leu Arg Phe Asp Lys 

320 325 330 

Glu Gly Glu Val Arg Glu Thr Ala Ser Gin Thr Thr Ser Ser Glu 

335 340 345 

He Ser Ser Val Met Val Gly Glu Thr Glu Ala Ser Ser Ser Ser 

350 355 360 

Gin Asp Lys Asp Lys Asp Ser Arg Cys Thr Arg Gin His Cys Thr 

365 370^ 375 

Glu Glu Asp Glu Glu Glu Asp Glu Glu Glu Glu Glu Glu Ser Phe 

380 385 390 

Met Thr Ser Arg Glu Met He Pro Glu Arg Lys Asn Gin Glu Lys 

395 400 405 

Glu Ser Asp Asp Ala Leu Thr Val Asn Glu Glu Thr Ser Glu Glu 
' 410 415 420 

Asn Asn Gin Met Glu Glu Ser Asp Val Ser Gin Ala Glu Lys Asp 

425 430 435 

Leu Leu His Ser Glu Gly Ser Glu Asn Glu Gly Pro Val Ser Ser 

440 445 450 

Ser Ser Ser Asp Cys Arg Glu Thr Glu Glu Leu Val Gly Ser Asn 

455 460 465 

Ser Ser Lys Thr Gly Glu Xle Leu Ser Glu Ser Ser Met Glu Asn 

470 475 480 

Asp Asp Glu Ala Thr Glu Val Thr Asp Glu Pro Met Glu Gin Asp 
485 490 495 



<210> 2 

<211> 2453 

<212> DNA 

<213> Homo sapiens 

<220> 

,<221> misc_feature 

'<223> Incyte ID No: 7526180CB1 

<400> 2 

actgtacaaa tgctttattt ctattcaata 
caatagcatg gtggcagatg aacatcagga 
cctcaccatg gatacgcttg tgatcaaggg 
tcagaggcca caccatccta gcagtggagc 
cacctgctgc atcaccaggc tgggtgaacg 
gtatcagcac cgaaacctca caggaaaaat 
actctcagat taatctctgt catccatcgc 



tttagaagac agttataaac aagatgcatt 60 
aggaacatcc atgagcttcc atccacggaa 120 
cctggtctcc cctcaagaca cggtcacaga 180 
agtaccagct gggacagggt ccttctgtga 240 
gacacaattg ccagaactca cagaatagaa 300 
ggtaagttct aagtttctcc attaatagta 360 
ttctccaaga aatgactttt tagggtgatg 420 
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tgccaggcgc catgttggag ggctggtggt 
tcttgctctc tcgcacgctt cccccggctc 
gccggagtgt gtgcgaggga gggggagggc 
cccaagagac ccgcggaggg aggcggaggc 
gagaggctcc aggaggcgct gaaagatttt 
gtcctggatc agtttctttg tcatgtagcc 
caatttaaag gctattttat tttcaaactg 
gctcctgagc caagaggtcc tcccaaccc^. 
aaggaaagaa tactgaaaat tgtcactgga 
ctatgtgaat tgttaacaga tccaaggaga 
ggagtagaaa agaatgtgat ggttgttagc 
tccaatagtt taaatcgaat gaatggtgtt 
gagaggtcta atataaatgg gcctgggaca 
ttgtcagccc ccatgacaac aaatgggttg 
ttgcagcaaa atgaggagaa aaatcacagt 
tcagtgagcc ctttgaaaaa taaacatcca 
gaggtaaaaa gactcaggtt tgacaaagaa 
acttccagcg aaatttcttc agttatggta 
gataaagaca aagatagccg ttgtacccgg 
gatgaagagg aagaagaaga gtcttttatg 
aatcaagaaa aagaatctga tgatgcctta 
aatcaaatgg aggaatctga tgtgtctcaa 
agtgaaaacg aaggccctgt aagtagtagt 
gtaggatcca attccagtaa aactggagag 
gacgaagcca cagaagtcac cgatgaacca 
tagatgcagt attttacata cagttctggt 
aaatggacct ttagttttac aagagaagca 
tcctgaaaga gttgtacatg taagaactgt 
ttaccgctga cttttctttc tttctttttt 
gatttttttt ttttttaatt taggattgaa 
gaaaagtttt ttgagattat cacatttagt 
tctctttctg atagctctag cagttttcat 
gtgaattata gggtttcatg ctggtttcca 
ctttaagtca batatacata catattatat 



asrcggcttgg ggaggtgctc gctctgtcgg 480 
ccttcgtttc ccccccccgg tcgcctgcgt 540 
gtcggggggg tggggggagg cgttccggtc 600 
tgtgagggac tccgggaagc catggacgtc 660 
gagaagaggg ggaaaaagga agtttgtcct 720 
aagactggag aaacaatgat tcagtggtcc 780 
gagaaagtga tggatgattt cagaacttca 840 
aatgtcgaat atattccctt tgatgaaatg 900 
tttaatggta tcccttttac tattcagcga 960 
aactatacag gaacagacaa atttctcaga 1020 
tgtgtttatc cttcttcaga gaaaaacaat 1080 
atgtttcctg gaaattcacc aagctatact 1140 
cccaggccac ttaatcgacc aaaggtttct 1200 
cctgagagca cagacagcaa agaggcaaat 1260 
gactcttcga cctctgaatc agaagtttcc 1320 
gatgaagatg ctgtggaagc tgaggggcat 1380 
ggtgaagtca gagaaacagc cagtcaaacg 1440 
ggagaaacag aagcatcatc ttcatctcag 1500 
cagcactgta cagaagagga tgaagaagag 1560 
acatcaagag aaatgatccc agaaagaaaa 1620 
actgtgaatg aagagacttc tgaggaaaat 1680 
gctgagaaag atttgctaca ttctgaaggt: 1740 
tcttctgact gccgtgaaac agaagaatta 1800 
attctttcag aatcatccat ggaaaatgat 1860 
atggaacaag actaactatt tagaaacatt 1920 
tttaacactg taCaaaactt ttgtgtaata 1980 
ggttgtaaaa teiaagtactt tatggataat 2040 
gaatatcagc tcctctgggt cctgcttacc 2100 
tggtctgggc aaatcagtgg tttgtgtata 2160 
gtttttaaac tggaaggtaa ttacaatttt 2220 
ttatacatat gcaagaagct ttttgtcttg 2280 
attttggtca tagtttcaac attttaacat 2340 
gattttattg tttggctacg tacaatggaa 2400 
atatataagg gggggggggg ggt 2453 
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Table 4 



Polynucleotide 

SEQIDNO:/ 

IncytelD/ 

Sequence LengA 


Selected 
Fragments 


Sequence Fragments 


5' Position 


3* Fbsiticin 


2/7526180CB1/ 
2453 


1-637,2344- 

2453,2111- 

2131 


g3846311 


1 


224 






g4525001 


I 


361 






8245726H1 (BONEUNROl) 


I 


376 






g7150771. 


I 


409 






g4899351 


I 


416 






g5437649 


I 


416 






g4970976 


I 


432 






g4148045 


1 


437 






g4568670 


1 


443 






7101420H1 (BRAWrDR02) . 


1 


445 






70354105D1 


1 


455 






g7 152801 


1 


457 






6716356H1 (CX5N1>TUT02^ 




483 






NNL019853 


I 


2049 






6756555J1 (SINTFER02) 


2 


193 






g6464275 


2 


424 






6355618F6 (LUNGDIS03) 


2 


464 






5992740F6 (FTUBTUT()2) 


2 


504 






g4136972 


3 


358 






g5392021 


3 


419 






6938693F6 (FTUBTUROl) 


3 


518 






70744891V1 


4 


190 






g5548017 


4 


280 






2704364T6 (PONSAZTOl) 


7 


860 






2254891T6 (OVARTUTOl) 


10 


561 






3187920T6 (THYMNON04) 


10 


579 






9770147U1 


10 


637 






glOQ5259 


14 


445 






2645122T6 (OVARTUTCi3) 


17 


509 






677877 1R8 (OVARDIROl) 


34 


331 




i 


677877 IJl (OVARDIROl) 


34 


331 






90088628R6 


37 


732 






90088528J1 


37 


762 






90088512J1 


37 


782 






90088544H1 


37 


784 






90088604H1 


37 


896 






90088520J1 


39 


695 






900886 12J1 


39 


726 






90088504J1 


39 


758 






90088636J1 


39 


782 






90088628J1 


39 


814 






90088620J1 


39 


880 






9599833U1 


40 


808 






2013516T6 (TESTNOT03) 


42 


572 






9581833XJ3 


53 


804 
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Table 4 



Polynucleotide 
SEQIDN07 
IncytelD/ 
Sequence Length 


Selected 
Fragments 


Sequence Fragments 

* 


5' Position 


3* Position 


* 




5992740T6 (FTUBTUT02) 


71 


320 






4520106F6 (SINJNOT03) 


71 


699 






g2620015 


84 


293 






3187920R6 aHYMNON04) 


85 


614 






70352268D1 


107 


526 






2013516R6 (TESTNOT03) 


120 


572 






70361157D1 


131 


748 






436901 IHi (THYMNOTin 


141 


420 






703503 19D1 


151 


509 






70362149D1 


154 


525 






9599833U3 


155 


806 






6355618H1 (LUNGDIS03) 


182 


475 






703525(»D1 


182 


666 






70361333D1 


186 


525 






70361647D1 


188 


512 






70362334D1 


188 


525 






70352145D1 


188 


733 






599274QH1 (FTUBTUT02) 


203 


504 






70422872D1 


219 


504 






70361701D1 


219 


525 






S6038863H1 


233 


518 






56038863J1 


233 


630 






gl0032608 


242 


331 






6549750H1 (BRAFNON02> 


248 


699 






70422650D1 


259 


444 






70351638D1 


260 


614 
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g3645529 
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